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INTRODUCTION 


The following account of wave mechanics is based on a series of eight 
lectures delivered by me early in 1940, as Sukhraj Rai Reader in Natural 
Science of the Patna University. It is too sketchy and succinct to be a text- 
book or a treatise and, when I undertook to write this, it was for the compel- 
ling reason that the Readership lectures have to be published. There is no 
dearth of books and papers on the subject, written in English and American, 
even by the masters who have invented the theory and applied it to pheno- 
mena physical, chemical and astronomical. Most of these authoritative ex- 
positions of wave mechanics are meant for the specialist and the advanced 
research worker and the average M.Sc. student of an Indian University is 
unable to follow them for lack of adequate mathematical attainment. It is 
this consideration which has largely influenced the present treatment. A new 
plan of presentation has been resorted to and in the execution of it lies the 
merit of the book, if there be any. 

From the Poisson brackets to non-commutative relations is a very wide 
gulf and the theoretical structure that runs across it has been built up part 
by part by Bohr and Sommerfeld, de Broglie and Schrodinger, Heisenberg 
and Pauli, Born and Dirac and several others. The many phases through 
which the quantum theory passed within a few years of this centufy have 
left some indelible marks on the final form of the theory. ’ Ttfus, in the most 
general theory occur terms and expressions which are described as Hamil- 
tonians, wave-packets and observables. A full understanding of the Dirac 
phase of the theory is not therefore possible without a knowledge of the 
theoretical foundations of each earlier phase of quantum physics. Hence it 
is in the fitness of things that one should st5rt with the shortcomings of the 
classical theory and climb up, till one reaches the more sophisticated compli- 
cations of the relativistic quantum mechanics, along the ladder of historical 
development. 

* 9 

More sophisticated, the complications produce genuine theoretical diffi- 
culties. The problems of matter and radiation become inextric$tfy mixed « 
up near the “edge” of the electron mainly on account of interaction 
effects and the production of electron pairs. “ Inside the electron ™ is the land 
of mystery wherein is probably treasured the secret of^ the slectronic self- 
energy and of the constants^ and h. Here are the frontiers, of theoretical 
research in physics an<t before reaching them ^we Jjave t6 pass in ’review 
Maxweirt equations, Bohr’s selection principles, Einstein’s photon-hypothesis, 
and the quantum theory of Yadiation, distinguished from the eailier*and un* 
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successful elaboration of Schrodinger’s theory to account for the dual nature 
of light. To make the outline of quantum mechanics complete it is therefore 
desirable to include an account of radiation from Maxwell to Heisenberg 
and Dirac. Further reasoning is not needed to justify the inclusion of the 
theory of radiation here. 

The field of application of wave mechanics is very wide. So only a few 
most prominent applications are touched here. The future of a theory 
depends upon its inner logic* the present upon its phenomenological success. 
For this reason, and also to bring out the connection between mathematical 
abstractions and the observable properties of matter, I have thought it neces- 
sary to give a few applications. 

The main business of a scientific philosophy is to clarify thought. 
Words fail to explain the correlation of quantum phenomena which is express- 
ed by symbols. Yet the phenomena themselves are describable by words ! 
It is imperative, therefore, to attempt to harmonise words and symbols in 
order to produce a clear understanding of the phenomena and their correla- 
tion. In the course of our exposition fundamental questions regarding causa- 
lity, the nature of light, probability, quantum forces, etc. have been examined. 
But in doing so we have taken special care to keep all metaphysics out. For 
the treatment of such fundamental questions in relation to science is the 
subject proper of the philosophy of science ; not of metaphysics. 

The basic principles of wave mechanics have not been formulated under 
theological inspiration as the principles of classical mechanics were. This, in 
itself, is A notable fact proclaiming the advent of a more rational age. The 
concept of the <de Qroglie wave was not suggested by a text in the New 
Testament or the Old. Nothing has inspired the theoretical worker more 
in this pioneer work than the patient and skilled experimentation in the 
laboratory. Nothing has affected tte course of evolution of physics more 
effectively than the progressive development of the experimental technique. 
Observation demands a new theory ; the new theory, when it comes, predicts 
the result of more penetrating observation ; the result springs a surprise on 
the scientist and calls for a more general theory. And so rapidly has the 
wheel turned in the domain of quantum phenomena that at times only a 
few rfionths separated the first intimation of the failure of a theory from 
the dawnAf another. These considerations have persuaded me to emphasize, 
wherever occasion arises, the part played by phenomena in shaping the growth 
of ideas and/ckeories. 

Forbidding, no doubt, is the appearance of the symbolism of wave 
mechanical Nevertheless the mathematical operations themselves are ex- 
tremely* simple* and straight-forward ; the methods have a tremendous power ; 
the logical structure has a beauty of its own. On the formal side of tye theory 
it is purely a calculus of states and observables. *In practice ofte is concerned 
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with the solution of certain differential equations or with properties of groups. 
The three varieties of mathematics are introduced in the following pages. 

During the months that occupied the writing, whenever I got stuck in 
my work, I was reminded of the last proposition of Wittgenstein’s tractatus 

LOGICO-PHILOSOPHICUS, viz., 

“Whereof one cannot speak, thereof one must be silent”. 

The critical philosopher would be silent. The speculative philosopher might 
wax mathematical. There is no other go because “ whatever can be said, 
can be said clearly and whatever cannot be said cannot be said”. The 
reader is well-advised to bear in mind these remarks or, rather, propositions 
of Wittgenstein while undertaking a serious study of the subject. The author, 
on his part, has found that when it comes to putting the ideas of modern 
physics in circulation words are counterfeit coins, symbols are genuine ones. 
On the other hand, the sense data alone are real and they are describable by 
words while the rest of the theory, including even the familiar concepts of 
mass and force, is logical fiction. There is no contradiction here between 
the last statement and the earlier. One has simply to be alive to the fact 
that the sensory experiences that are best described by words are best cor- 
related by symbols. 

There was a time when the term “ wave mechanics ” was reserved for 
the de Broglie-Schrodinger theory giving wave representations of the posi- 
tion and momentum of a particle. Wave mechanics was thus distinguished 
from “quantum mechanics” or the theory of non-commutative ooservables 
and also from the “ quantum theory ” of Bohr, Sommer f eld # and others «which 
was based on classical principles and classically expressed quantum condi- 
tions. We use the word “wave mehanics” in the most general sense as 
Frenkel has done it. It is perfectly legitimate to bring the abstract algebra 
under wave mechanics since the symbolic operators are linked to experiment- 
ally verifiable results only through the eigenstates or wave functions. 

One word more before we turn to the march of ideas through the inevit- 
able jungle of symbols. I have to thank the Senate of the Patna University 
as much for this honour of Readership as for the opportunity to collect 
thoughts on this subject. I feel particularly grateful to Prin. K^nta Prasad 
and Prof. D. N. Sen for their unstinted help to make these lectures a success. 
It is also a pleasure to acknowledge my indebtedness to the^Registrar of the 
University and the Manager of the Kamatak Printing Press,* Bombay for 
their unfailing kindness, courtesy and efficiency in the matte? of printing this 
book. 
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1. Postulates. 5. Conditionally Periodic Systems. 

2. Principles. 6. Degeneracy. 

3. Hamiltonian Systems. 7. Relativistic Expressions. 

4. Poisson Brackets. 8. The Failure of Classical Mechanics. 

1. POSTULATES 

The two basic postulates upon which classical mechanics is built up are 
the following : 

(A) The physical universe is describable in terms of an absolute Eucli- 
dean space and an absolute time. 

(B) The events of the physical universe are governed by the law of 
causation. 

There is a conventional meaning attached to the statements (A) and 
(B). In the treatment of any problem a fixed frame of reference O-xyz is 
chosen with respect to which the co-ordinates of material objects are specified. 
The origin is a fixed point of space and the directions of the axesi are fixed 
with reference to the fixed stars. In the final calculations it is only the 
relative co-ordinates with respect to some real object that enter and the posi- 
tion of the fictitious fixed origin drops out of the concfusidn. Thd absolute 
time is measured by the rotation of the earth about its axis relative to the 
fixed stars. In the second postulate appears the much discussed phrase “ the 
law of causation ”. We are not concerned here with its most general mean- 
ing ; what it means in mechanics is all that matters. Every physical system 
is characterised by certain parameters ; and, according to the law of causa- 
tion, the values of the parameters at one instant lead to a unique set of 
values at any later instant by virtue of the laws of motion. It is implied 
here that a state of the physical system is completely known’ when the values 
of the parameters for that state are obtained. There are three laws ofhiotion 
given by Newton which are unverifiable and, therefore, of an axior^tic charac- 
ter. We give below the authoritative statement of these laws from Thomson 
and Tait. 1 

“First Law— Every body continues in its state of rest or of uniform 
motion in a straight line, # except in so far as it may be copipetted by force 
to change that state. 4 

Second Law— Change of motion is proportional to force applied, and 
takes place in the direction of the straight line in which the foite acts. 



& 


WAVE MECHANICS 


Third Law— To every action there is always an equal and contrary 
reaction ; or, the mutual actions of any two bodies are always equal and 
oppositely directed/’ 

Consider the motion of a particle of mass m, moving along under the 
action of a force F. Wei have the familiar equation, 

* mx = F. (1) 

When F = 0, x is constant and the first law is satisfied. In the second 
law there is the phrase * change of motion ’ which really 9tands for the time- 
rate of change of (mi) and 90 (1) illustrates the second law. If we look 
upon (mx) as the kinematical reaction 2 to the external force F, then the third 
law also is found to be illustrated by (1). 

Two new concepts are introduced by the three laws ; one is that of 
kinematical mass and the other, that of force. It is necessary to distinguish 
the kinematical mass from the gravitational mass featuring in Newton's fourth 
law, the law of universal gravitation. The former is a measure of the kine- 
matical response which a body gives to the force while the latter determines 
the gravitational pull that a body exercises on another. An exact statement 3 
of the law of gravitation is appended for reference : 

“ Every particle of matter in the universe attracts every other 
particle, with a force whose direction is that of the line joining the 
two, and whose magnitude is directly as the product of their masses, 
and inversely as the square of the distance from the each other/' 

i 

Thought-experiments can be devised to determine the kinematical and gra- 
vitational masses of bodies. There is no a priori reason why the two should 
be equal for bodies. Yet they 1 are found to be equal 4 and that leads to a 
remarkable simplification of the theory. 

Force is a logical fiction invented to explain different states of mind 
and matter. In the sense in which the term is used in classical mechanics 
it brings out the inter-connectedness of the components of a system and ex- 
plains the changes in configuration. The forces of action and reaction are ex- 
change relations between the components of a system or between the system 
and some external agency. • When the external agent is absent or ineffective 
t£ie syst&n»fcatisfies one or more of the laws of isolation which are the princi- 
ples of conservation of linear and angular momentum and energy. 

The laws of motion are enunciated for structureless bodies or idealised 
particles. The broblem of n bodies is formulated by superposing linearly the 
equation^ of equilibrium of all. The procedure was t explicitly set out for 
the first time by d'Alemberts The treatment of complicated problems is 
considerably simplified by d'Alembert's principle 6 although its ebsence is con- 
tained in the three laws of motion. 
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With the weight of accumulated knowledge physical science has burst 
through both the postulates (A) and (B) ; and wider limits control it on 
the separate fronts of general relativity and quantum mechanics. The new 
mathematics of q * s and p’s transcends the orthodox mathematical limitations 
of the law of causation and of the absolute Euclidean space and time. 

Some points of the inner structure of classical mechanics deserve notice. 
In the first place, the first law of motion cannot be fully reconciled with (A). 
The state of rest and that of uniform motion are kinematically distinct but 
dynamically indistinguishable. Secondly, no explanation is given, as we have 
already stated above, how the kinematical and gravitational masses are equal. 
The first of these points has led to special relativity while the other constitutes 
the basis of general relativity. According to the special theory there is no 
real distinction between the state of rest and that of uniform motion. The 
general theory goes a step further and liquidates the distinction between the 
two states and that of acceleration so that there is no need to distinguish the 
kinematical mass from the gravitational. At the cost of (A) mechanics and 
gravitation have thus been welded together and a new correlation — the cor- 
relation of field laws— between events is set up other than (B). 

There are certain other unsatisfactory features, concealed in the postu- 
lates of classical mechanics which have been brought to light by the diffi- 
culties of quantum mechanics. Newton’s laws imply the existence of (1) 
structureless bodies or particles, (2) distinguishable particles and (3) velocity 
at an instant for a particle. The assumptions were made for mathematical 
simplification. The first two were accepted because for a long time no phy- 
sical situation was known that could challenge them ; the third found favour 
because the rigorous treatment of limits was unknown in the early da^s of 
classical mechanics. 

Given a large assembly of similar particles 6 which are in rapid motion 
and whose mean free path is sufficiently small it becomes practically im- 
possible to determine the average velocity ©f a particular particle since there 
is nothing to decide whether the particles seen in two different positions at 
two different instants are different or not. Even if the practical aspects are 
set aside the theoretical question remains as to whether the laws regarding 
the different states of the assembly can be deduced from the classical laws 
of motion. This question has been discussed by Mach 7 and others /nd it is 
found that the required statistical laws cannot be deduced from tlie classical 
laws on the assumption that each particle is distinguishable from another. 
Quantum mechanics is generalised enough to treat such cases. 

The difficulty caused # by the assumption of structured bodies persists 
even in quantum mrchanics and the problem of the electronic self-energy 
and mass still defies solution. • % 

T?ie logical fiction of # velocity at an instant is retained even in quantun 
mechanics but, as we will show later, the uncertainty principle mak(£ up for it 
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It was Zeno 8 who first propounded the paradox of the arrow. Suppose that an 
arrow is in motion at A at some instant ; then it means that after a time it 
will be at some other point B. But we can also think of the! arrow stopping 
at some intermediate point between A and B. Hence the original assump- 
tion! that the arrow can go to some point B, no matter how near it is to A, is 
unwarranted and therefore wrong. From this* was concluded that the arrow 
cannot be in motion at A. The solution of the difficulty is as follows 
When we see a body moving the impression of motion is cinematographic. 
What is actually observed is a body in one place at one instant, ini another 
place at another instant, in an intermediate place at an intermediate instant 
and so on. In an ideal case the situation can be best described by a simple 
functional relation, 

* = /(« ( 2 ) 
between the position co-ordinate s and the time co-ordinate t. To say on ob- 
servational evidence that s is a continuous function of t is unjustifiable since 
this cannot be decided by observation. But even if this is permitted for 
mathematical simplicity we cannot say that ds/dt exists at a particular instant. 
The average velocity between two neighbouring epochs is obseityationally real 
but the limit of the average velocity, ds/dt, is observationally unreal and is 
a logical fiction. Thus it is possible for bodies to change their places with 
time so as to produce the false sensory impression that there is a continuous 
displacement and that ai velocity exists at each instant.® 

So these are some of the considerations which, as will be found later, 
make the formulation of quantum mechanics on the basis of (A) and (B) 
impossible. # # 

It is safe to divide the domain of scientific phenomena into three com- 
partments, of which two are already known as microscopic and macroscopic. 1 ® 
The former is the domain of quantupi laws while the latter presents large- 
scale phenomena subject to the field laws of general relativity. Between the 
two is the third compartment, thg buffer state, which may as well be given 
the name ‘ Macroscopic '. This is the domain of classical mechanics. As 
we have already remarked above certain generalizations of the classical theory 
have led to general relativity ; other generalizations to quantum mechanics. 
The utjits of mass, length and time which characterize the three domains may 
be stated hf.re for comparison. 

c M . L T 

Microscopic, HT 38 gm 10” 8 cm 10" 17 sec 

Mecroscopic 111 
Macroscopic 10 33 10 17 10 8 

To produced theory of mechanics Valid for the electron as well a9 the spiral 
nebula is* the dream of the theoretical physicist. The* task is fraught with 
many difficulties. The classical mechanics fails inside the atom* The quan- 
tum mechanics fails “ inside the electron ”. 
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2. PRINCIPLES 

Let us take a dynamical system consisting of n particles and write down 
the equations of motion of any one of them as 

mi = X + £ my = Y + Y, mi = Z + t, (3) 

where XYZ stand for the externally impressed force while X, Y, Z, are the 
components of the action* on m by the surrounding particles. The equations 
(3) give the equilibrium between the kinematical reaction (mx, my, mz), the 
external force and the action of the surrounding particles. Similarly, any 
other particle also is a seat of equilibrium of forces. (Hence the joint system 
of n particles must be in equilibrium in this sense. It follows that the six 
equations of statical equilibrium of forces can be written for the system. In 
doing this we need only note that the relevant force at (x, y, z) has the 
components (mx-X f my-Y, mz- Z) since, in the joint system, all the equal 
and! opposite forces of the nature of X, Y, Z, drop out. The equations of stati- 
cal equilibrium give 

Ymx = YX, I my = YY, Ymz = YZ, 

Ymiyz — zy) = Y(yZ - zY), Ym{zx — xz) * Y{zX — xZ ), > ( 4 ) 

Ym(xy — yx) = Y(xY — yX). ) 

In the derivation of (4) we have stuck to the terminology of Newton’s laws 
to convince the reader that it is not necessary to evoke d’Alembert’s principle 
which is implicit in them. The linear momentum along Ox, Ymx is conserved 
when YX — 0. The angular momentum about Ox , Ym(yz — zy) is «lso con- 
served when the moment of forces t{yZ — zY) vanishes. #The two principles 
of conservation of linear and angular momentum follow from (4). If* we 
multiply the three equations (3) by x,y, i respectively and add and then 
combine the n equations so formed for % the system, 

S m (ii + yy + ® - I (AS + Yy + ZS) + 2 (Xx + Yy + Zi). (5) 
In the summation, 

Y( Xx + Yy + Zi) = 0, (6) 

since equal and opposite forces of action and reaction give corresponding to 
every term of the form (Xx + Yy + Zi) another, viz., - (Xx + Zi). 
Therefore, 

J m (** + y + a - 2 (Xx + Yy + Zi). 
at t 

When the forced are conservative, that is, When a function V(jc, z) exist* 

such that 
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we have the integral of energy, 

S i m (*’ + ? + i’) - S / (Xdx + Ydy + Zdz) - const. (7) 

or, in the usual notation of the kinetic and potential energy, 

T + V ■ constant. (8) 

This is one form of the principle of energy. 


Suppose that the system of n particles has actually only s degrees of 

freedom. It may be assumed that s independent co-ordinates q v q 2 q a 

specify the configuration at any time t. The co-ordinates of a typical particle 
of mass m, viz. x, y, z can be taken as functions of the s co-ordinates and the 
time t. It is clear that 


. 8i dx d_/ dx \ 

* - + 2l eq i 9it dq t dq] • dAel) 


St 


Si 


(9) 


We get from (3) 


J ” ( s | + 5 S, + 5 1,) ‘ 1 { 1 ^ + + (r + ^ 0, 

+ (Z+2)™} (10) 

= Qi, ( 11 ) 


Qi being defined in this manner. Now 


v /.. dx . .. dy , .. dz \ v / .. dx . .. dy , .. dz \ 

• V dq. J dq { e qi J V dq { dq { dq^ 

* i 1 4; (I ” 1 w + ? + A ) - 1 ” { i 1(0,) + > 1(0) 


d /07\ _ er 

* df \ SqJ Sq { 

There are the,s Lagrangian equations of the type, 


• A. (AL\ 

, • dt \Sq) Sq 


Q. 


(12) 


(13) 


When the potential function exists, Q = dV/dq, as considerations of work for 
virtual displacements Sq show immediately. 
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Hence 

-»• <»> 

where L = T— F is called the Lagrangian of the system. The Langrangian 
equations 11 are valid 1 for more general cases also. They can be deduced by 

/ fm 

t\ L ^ 

to be stationary, the terminal values of q and t being assigned. The whole 
science of dynamics can thus be based on the stationary character of a certain 
integral, 

^J^Ldt = 0; (15) 

that is, given any arbitrary portion of the trajectory in the q- space if any 
other path is considered with the same terminal values of q and t the integral 
over the actual path is either a maximum or a minimum. This is Hamilton's 
principle. It follows that 

>Sl Ldl - frit** 

- + < 16 > 

The terminal conditions give the first expression as zero. Moreover, ^nce 
%q. are arbitrary we have by Hamilton’s principle, 

f “ £( e ~ L ) “ °- ( 1? ) 

dqi dt \ cq% J 

which are Lagrange’s equations. The principle of least action, 

B f '^Tdt - ( 18 ) 

can be deduced from Hamilton’s principle for natural dynamical systems 
for which T and V do not contain t explicitly, the energy constant being the 
same for the actual as well as the varied paths. Let 

St L * mW - 

and T + V - at i 

so that W - S - a i (/ 2 - *i). 

W is Hamilton’s function whjje S is the action function. It is obvious under 
the conditions of variation that if §W *0, US * 0 also- WHIen r** 0 , 
Hamilton’s principle reduces to the principle on whichr statics is based, viz., 

• 

ifvdt -wfdt - 0 or BK- 0. 
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3. HAMILTONIAN SYSTEMS 

The Hamiltonian equations 11 ' can also be deduced from the principle, 

«/!; ™ « o. 

Consider the expression, 

2pq - L{q, q, /) 

out of which q may be eliminated by using the relation, 

6L 


P = 


df 


defining the generalized momentum p. Let 

H(p, q , /) Ipq - L(q , q , t). 

Hamilton ’9 principle may now be expressed in the form 


«/!’ [~H+ ipq] dt = 0 , 


( 19 ) 

( 20 ) 

( 21 ) 

( 22 ) 

(23) 


that is, 


jj I -1 ? 1 * " + + [ ~ ft ^ dt = °- (24) 


dq " 6p 
It follows that, as terminal values of 85 are zero, 

8H ■' dH . 

8q ~ P ’ 8p q - 


(25) 

These are the canonical equations of Hamilton. 

Afi important feature of a variational principle is that its statement is 
independent of the co-ordinate system chosen. If another co-ordinate system 
is chosen, say, q-p, we would have in place of (23), 


7 = 0 

(26) 

- i 

(27) 


and instead of (25) 

dH _ * ' dH 
dq ~ ~ 6p 

The equivalence of (23) and (26) for any two terminal conditions suggests 
that 

' ' , ; m - HiP, Q, = Hip, h, t) + f t (28) 
where S is some function of q, p, % q, p. In particular let 

’• S^Siq.it). (29) 

The identity (28) gives 

as , 1 as . 1 as 

. »-*•■ ?■-«■ „ <*» 
The transformation from p-q to p-q as given by (30) i§ called a contact 
transformation . 111 The geometrical significance which gives this transforms- 
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tion its name is that if two hypersurfaces are tangent at a point in the q—p 
space their transforms are also tangent at the corresponding point in the 
q—j> space. In the simple case of two dimensions if two curves touch at a 
point their transforms also touch at the corresponding point in the other 
space. 

There are three mother forms of contact transformation possible according 
to the nature of the function S chosen. It is sufficient to indicate here one 
other type which is most useful for our applications. Let us have 

Zpq - H + P, t) - 2&] (31) 


so that 

S=F(q, p,t) 
Expanding the last term of (31) we get 
dF , 6F 




P 6q’ Q “ dp ’ 


H - H + 


6F 

dt 


which is another contact transformation between p-q and p - q. 
Consider the equation 

„/dF <\± dF 

Keg’ Q ' t ) + di 


Hi 


0 . 


(32) 

(33) 


(34) 


This is the well-known Hamilton-Jacobi differential equation. It can be 
constructed for any problem when the total energy is expressed in the Hamil- 
tonian form, every p being replaced by the corresponding dF/dq. Hamilton's 
function, W^/Ldt, satisfies (34). 


The transformation function F may be so chosen that t is not con- 
tained explicitly in it. In particular, let the system be conservative, the con- 
stant of energy being a v The contact transformation (33) now leaves H 
invariant, F being obtained from the modified equation, 



We assume that t does not appear explicitly in H the system being conser- 
vative. The action function satisfies the Hamilton-Jacobi equation (&4). 


In either case we proceed further as follows. A complete integrant the 
equation (34) or (34') will contain n arbitrary constants if there are n q's, the 
additive constant of integration being neglected as it plays no f>art of physi- 
cal significance : as (33) shows only the partial derivatives o^F are signi- 
ficant. Let the n constants pi integration be a r . The function F(q, a t) 
gives the transformation* in the first case, 


P « 


6F 

dq 


3 


dF 

dk 


Z 


H = 0 


(35) 
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from p-q to 3 -j3. The Hamiltonian equations of the new system are, 


da 

dt 


- 0, 


^ - 
dt 


= 0 


( 36 ) 


90 that all the as and g's arc constants of the motion. By the conditions 
of contact transformation the coordinates and the momenta become express- 
ible as functions of a, j3 and t and so the problem is fully solved. 


In the second case the transformation becomes 


6F 

dq 


ft _ 8F 
P " S« ’ 


H - H - a,. 


The Hamiltonian equations are 


dj} = 6H . _ 8H 
dt da ’ a d$ 

As H - aj, 

d = 0«$5*^3= * P* , Pi “ f + const. 

The full solution follows from the equations, 


(35') 

(360 


F being determined by solving (34'). 

As an illustrative example consider the motion of a particle of unit mass 
under gravity. We have 


and so ‘ 
Hence 


T = *(** + ?), 

6T 8T 

8x ~ P ' ’ 8y ~~ 


V - 
P- 


gy 


H— T + V m bip* + pi) + gy. 


The Hamilton- Jacobi equation now becomes 

t 

1 [(dF/axV + (8FI8 y) 2 ] +gy + f { = 0 

One solution is 


F « a 2 x - ai / + J ( 2 a 1 - a* a - 2 gy)^ dy . 

It hiay be noticed that a Y is the constant of the energy integral. We have 
droppc'Vche additive constant m F since it has no part to play in the dyna- 
mical problem. 

8F/8ai ■ Pi, dF/dat = Pai 

or Pi<+ i - f — — t — i. Pa - * t- f ,. 

■ ' J (2a\-a\-2gyV J J.2a\-a\-2gyr 

These eqtiations give the time of description and the orbit. To colve this 
proWeni one may also proceed with (34'). * 
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The special advantage of the Hamiltonian system and the contact trans- 
formation is brought dut in the perturbation method. Suppose that the 
equation (34) is too complicated ■ to give an exact solution. It may be 
possible in such a case to split up H into two parts H 0 and R so that 

H = R, (37) 

* (*•“) + f-»- ,38) 

Using the value of F as given by (38) the co-ordinate system is obtained 
and the equations (36) are replaced by 

e - R = - d, P = $ (39) 

’ da. 

R being expressed in terms of a, (3, t. 

A peculiarity of the Hamiltonian system of equations deserves notice. Let 
$ (q, p, t) be an integral of the system. A variational equation for 8? is 

(40) 


d „ „ b*h _ , v j?h 

dt T,q - 1 dqVdp 0Qr ^eprdp pr ' 


This follows straight from the equation, q = dHldp. If we substitute for 
8 q r and 8 p, by means of 


6 q, = e 


6$ 


epr’ 

where e is a small constant, 

6*H 


Opr = ~ t 


d<j] 

dq r 


(41) 


d « 
dt* 9 


c v 6 2 H ' 6<t> _ y d<t> | 

" 8 ( “dq r dp dp r dprdp dq r 1 


-•[ 


a f 

Midi, _ 

JBH d± 

dpi 

dq r dpr 

dpr dq» 

eA 

1 

•e-i ^ 

1 

y • 6* 

1 9r dq r 

r e 

/ 0</> d<j> ' 

) + -<L( 

L dp 

\ dt dt 

) + dt V 


.dp 


d 2 <f> 


ay 

dq r dp 


d /di\ 

6 dt \dp) ' 

since d<f>/dt vanishes on account of <J> being integral. Thus variational equa- 
tions for 8q and 8p are satisfied by (41). Now the transformation fmm 

q r 'pr to hr-pr where 

Qr* m qr + pr =* Pr + &/V _ (43) 

is a contact transformation when a function W (q, p) exists such that 

# M$rdq r -Prdq r ) » dW 


( 44 ) 
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or l{(pr + Spr)(dq r + d &?,) - pr dq r } * dW 

or E ( dq r %p r + p r dlq r ) = (45) 

neglecting the term 8p r d8q r of the higher order. Let 

8pr = 6 pr, tyr = 6 q r . (46) 

If IP = we and c is a small constant, 

E ( dq r pr + pr dqr ) m du (47) 

or l(p, dq r - dp r q r ) = d ( « - E p r q r ) « (48) 

h "“ <«> 

fiy comparing (41) with (49) we find that (41) is a contact trans- 
formation. All infinitesimal contact transformations which preserve the phy- 
sical content of the system, that is, which transform the family of orbits 
into itself are of the type (41). On the other hand, the transformation which 
replaces H 0 by H 0 -f R = H even where R is an infinitesimal is not in 
general of the? type (41). Hence the dynamical system characterised by 
H 0 is different from that characterised by H. It is the possibility of going 
from one dynamical system to another, through a contact transformation, 
which enhances the value of the latter. 

As a dynamical system progresses the values of the variables q r -p r at 
time t are replaced by 


Qr+j~*t, Pr-f-% (50) 

cp r Oq r 

at tf + 8t. H being an integral of the system the motion may be interpreted 
as a progressive transition from one set of variables to another through infi- 
nitesimal contact transformations. The same fact is expressed alternatively 

when we say that Hamilton’s function L dt is a solution of the Hamilton- 

J h 

Jacobi differential equation. * 


4. POISSON BRACKETS 

Tkq ‘Poisson brackets 1 * play a very conspicuous part in classical as well 
as quantum mechanics. The non-commutative relations of the latter theory 
were deduced by Dirac from the Poisson brackets expressed in action— angle 
variables. The same relations, which had appeared first in a paper by 
Heisenberg, were suggested by the matrix- formulation of observables. As 
the classical Poisson brackets of a dynamical system ^satisfy conditions which 
are very similar in fprm to the quantum conditions some important groperties 
of these classical expressions will be given here. * 
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Consider a dynamical system characterised by the n co-ordinates q n 
and the n momenta p r , r running from l to n. If w, v are any two functions 
of the co-ordinates and the momenta, the Poisson bracket [u, v] is given 
by 


[u,v] 



/ 8u dv 
' dq r dp r 


dud$\ 
dpr dq r ' 


(51) 


Hence [ u, v J = - [ v, u ] and [ u, u ] * 0 

and it can be easily verified that if w is another function of q ' s and p ' s 
[(u t v) t w] + [[v,w],u] + [[w,u],v] =0. (52) 

Thi9 is a well known identity due to Jacobi. Directly from the definition 
of the Poisson bracket the following obvious relations follow : 


C qrt pr] — ll, f qr> Ps ] — 0» (53) 

[ pr> ps] = o, [ q r , q$ ] = 0, f, s = 1, 2, 3, tt 

r j* s. 

The Hamiltonian equations of motion are equivalent to 


q=[q,H], p — [p, H] (54) 

where, as usual, H denotes the Hamiltonian. Further, if the variables are 
changed by a contact transformation so that the equations of motion now 
become 

4=UH1, j> = [p,H] (55) 

the two sets of variables are connected by the relations, 

Ihrypr] = 1, [ hr* Ps ] =0, * (56) 

[ pr, ps 1=0, [ q r , q s ] = 0, r, s = 1, 2 n 

r =h s. 


It may be emphasized that (53) are obvious identities while (56) are equa- 
tions connecting q-p and q -p. For establishing (56) it is enough to show 
that for an infinitesimal contact transformation, viz., 


hr — q r + e 


(57) 


i _ * _ 

dp r ’ P " Pr 6 Sq r ’ 
where is any function of p-q , the relations are valid if e 3 is negligible. 
Inasmuch as every contact transformation may be supposed to be f pnerated 
by successive infinitesimal contact transformations the relations (56) become 
valid generally. The usual proof of (56) is based on the notion of Lagrange 
brackets. 

An immediate result of 456) is the important property of Poisscp brackets 
usually stated explicitly^ under : 


[«%] 


\dqrdPr dPrdqr) 
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which can be directly verified by using the values of the n* brackets (56). 
In general 


f - Sr + *£>« + *£ 


= du/dt + [u,H], (59) 

Hence if t does not appear explicitly in u the condition for u to be an integral 
of the system is 

[u t H] = 0. (60) 

By using the identity (52), [u, v] may be shown to be an integral of the 
system if u and v are any two independent integrals. 

Let us now expand a Poisson bracket of the form [xy, zw] without 
disturbing the order of x relative to y and of z relative to w. Two expansions 14 
become possible. 

[xy, zw] = x[y,zw] + [x,zw]y 

- x{[y,z]w + z [y t w] } + { [x, z] w + z [ x , w ] }y. (61) 

Similarly 

[ xy t zw ] = [xy, z]w + z[xy, w] 

= {x[y,z] + [x J z]y}w - f z{x[y y w] + [x, w ]y }. (62) 
Comparing the two expansions we get 

' xz [y, w] - zx [y, w ] = [x, z]yw - [x, z]wy, 


(xz- zx) [y,w] = [x,z] (yw - wy). (63) 

In classical mechanics xz=zx, ym-wy but in quantum mechanics where 
the variables have a different significance this is not always true. By a postu- 
late of Dirac’s theory, f 

(xz - zx) = ~~ [ X, z J , 

6 ir 

< s a r , (64) 

(yw - wy) = — [y,w] , 

u V 

h beinf-Planck’s constant and *, a square root of -1. The trivial relations (53) 
now become the quantum conditions of Dirac’s theory : * 

i, 

ih 

Qr fr ~ pr Qr = Qr Pi ~ Ps Qr = 0 , 

• • 
pr ~ Pt pr = 0, q, q, — q, q, = 0, •> 
r, s = 1, 2, . . . n 
r + s, ‘ 


( 66 ) 



CLASSICAL MECHANICS 


IS 


It is worthwhile noting that (63) becomes an identity alto when the values 
of Poisson brackets as given by (64) are substituted therein. These (66) 
are also known as exchange relations. 


5. CONDITIONALLY PERIODIC SYSTEMS 


Consider a conservative dynamical system for which it is possible to 
use co-ordinates splitting the Hamilton-Jacobi equation given by the energy 


integral into a series of equations of the form 15 



©* = u (4 ‘ ai> 8,1 • • • 

. . . «(,). 

(66) 

We have in such a case 



F=J J (fi) ll 2 dqi. 


(67) 


Further let U9 suppose that when does not reduce to a constant there 
exist two excessive roots q% and qi between which qi must lie if dF/dqi is 
to be real.' In the latter case there is a libration in the q { co-ordinate between 
qi and We define for a complete libration in the q { co-ordinate the integral 
Ii as given by 

h = $ pi dqi = $ (fi ) ll2 dqi. (68) 

During a complete libration of q if F increases by U which is therefore called 
a modulus of periodicity of F. It is clear that each / is a function of* a’s 
since q and q, the limits of integration, are roots of 

f(q, ol\, a 2 . . . . a») = 0. 

Let us suppose that there are as many independent 7*9 as there are as. F can 
now be expressed as a function of g’s ancf 7’s. If we denote dF/dl by 
W there is the contact transformation 


P = 


6F 

dq' 



H = H, 


J69) 


dF/dt being zero. The new Hamiltonian equations are 


* 


*jl : _ i 

ei ’ dW 


(70) 


When the energy integral H»= ^ exists H % is expressible as a function of /’s 
only. Therefore, 



( 71 ) 
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and 7 is a constant. Moreover dH/dl, being a function of Fs only, is 
constant for the motion. So 

W = <o, W = + s, (72) 


the constants o) and s being defined by the equations themselves. The vari- 
ables I-W are the action-angle variables of the classical theory and id’s are 
frequencies. 

The relations between I-W and q are best brought out by showing the 
effect of a libration in qt on W*. For 


aw 




e 2 F 
dh dqi 


dqi 


dh ^ dqi 




(73) 


that is, the effect on W k is zero or an increase by unity according as the co- 
ordinate of libration is other than <h or q k itself. Hence the q ’ s are func- 
tions of, iP-J being periodic in each W with period unity. Thus 


q — ££ ...... Cxj Tj • • • • 


2 IT I (ti t*>! + + • • • • + T ft to n)t 


where Xu Tj, .... Xu are integers or zero and the coefficients Cxi r„ 

are functions of J* /«. If there exists a set of integers for which 

+ X 2 ^2 4* .... 4" t n ton — 0 (75) 


the system is said to be degenerate. It may be singly or multiply degenerate 
according to the number of distinct relations like (75) which connect the 
frequencies. A non-degenerate system evidently will not pass through the 
same configuration 1 twice but it can be brought as near as one likes to any 
configuration of the past by choosing a sufficiently distant epoch* Hence a 
non-degenerate system is called conditionally periodic. For a doubly periodic 
system, for example, let 


Q 


2 2 Cti1? e 


2 ni ( Ti toi + T 2 to 2 ) t 


where to\ = tz to 2 . When / - 0, 

q = qo = 22 Cti t*. 

For mo other value of t, q = <?o* But if we take toi * 3 0 ) 2 , approximately, 
q fe nqqriy equal to q 0 when t = l/u> s . Similarly if we take oh * 3-1/uia 
* as a better approximation, we have to consider the epoch t m 10/<o, = 31/<Di 
for getting q nearly equal to q 0 . 

A relation of great interest in Bohr’s theory may now be obtained. The 
Hamiltonian of a conservative system is equa[ a, and hence a function 
of /’s ,only as already stated above. Hence by (70) and (72), if the 
variation in energy is &£, ‘ 


\e = 2 id B7. 


(76) 
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By the conditions of the contact transformation (69) 


or, 


Ip + SWB/ = BF 


Zplq - S nW = B ( F — S IW) = BF'. (77) 

For a complete libration in any of the q ' s there is an increase of the 
oolrresponding / in F as also in the function XI W as (68) and (73) show. 
Hence F' remains unchanged in a complete libration. This fact is used to 
prove that the mean kinetic energy over a long interval of time is i v X I u>. 
For the required mean over a long period of time t 0 is, by (70) 


+ • >; (78) 

[f 7 ° is bound to remain finite because of what is known of F~ As fe 
0 * 

increases the average value of the kinetic energy over a long period is found 
to be very nearly iS/w. It is only in the limit as t„ —>x that this value 
is actually attained. 

The action functions I play a very important part in Bohr’s theory. 
This is due to the following important theorem leading to Ehrenfest’s 16 adia- 
batic principle : 

“The /-lvalues are the same for two dynamical systems which* are dis- 
tinguished only by different values of a parameter in the Hamiltonians.” 

The theorem is not true for degenerate systems. * 

To prove the theorem consider the Hamiltonian H (q, p, c ) in which c 
is the parameter varying with time, the contact transformation (77) gives 


= - I, # = H + 


8F 


8F 8F 

dq p ’ 6W *’ ~ “ 1 St 

F' containing t explicitly in this case. There follows the equatior 

dl = _ SH _ SH _ &F 
dt = 8W SW SWdt 


(79) 

[80) 


W does not appear in H in the case when c is a constant The* effect of 
a varying c is only to make the /’ s functions of t. Therefore, 


8H _ 
dW ~ 


0 


(81) 


and 


SF' 
dt = 

SWdt 


dF . * . v 2 j«' fxi <i>i + +,% w.)/ 

&W* tS V (82) 

» cSB t e 2 ™ ( Tl Wl + + t, «■>») I 


Hence 

3 


t 


(83) 
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there being no constant term, as a result of differentiation, in the right-hand 
side of (83). In writing the expansion of F f we have used the earlier 
result that F is a multiply periodic function when c is a constant. We 
expect A 2J to be coefficients involving t through c. u)’s also*will involve 

T> T, 

t through c. In spite of these complications it is obvious from (80), (81) 
and (83) that 



k&F _ 

a z dt - ~ Cm CL 

t\ dwdt 


(84) 


where we take c m as the mean value a being a finite multiplying factor. 
Hence by making c m conceptrally small enough we get 


['!*»• « 
The /-values are therefore invariant for a change in the value of the para- 
meter c. 

By way of illustration an oscillator may be considered, given by 


x 4- jix - 0. 

When jx =■ [iu let x » a 0 cos ( p n t + ey ) 

where “ Pv- 

When pi = lit let x = a x cos (p\l + ) 

where # P-i = P\* 

Let us suppose th?t the transition from jx = jx 0 to jx = }Xi takes place very 
sldtoly and that during the transition 

pi » jx (t), p = p ( t ), a = a (/), e “ e (/). 

Substituting x * a cos (pt + • ) 

in the differential equation we |ind 

[«-« + «) ] j = — 

or*' + 

Hence if a, p, e are very slowly varying functions of time we get what i9 
analogous to /85) 

, dp = C, or <4 Po ** c?\ Pi m C. (85') 

It is thil result, as will be showr. later, which makes Planck’s hypothesis so 
succes§ful % and earns for it a central position in the logical development of 
the quantum theory. ‘ 
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6. DEGENERACY 


Let the frequencies of a multiply periodic system 15 be connected by the 
relation of degeneracy, 

Ti U>i + Ts Wj + • • - + = 0. (86) 

This is equivalent to 


6H , 8H, 

Tl eh + Ts a/, + 


. 8H n 

+ T - aZT = °- 


The general integral of the partial differential equation (87) is 


H * iTuh Jn-i ) 


(87) 

(88) 


where 



— , etc. etc. 

T H 


It will be remembered that Tl , t 2 , etc., are all integers. The /’s are the 
new independent action variables. Their number is one less than that of the 
I's. When the number of independent action variable is less than that of 
coordinates the system is degenerate. Each condition of degeneracy reduces 
the number of independent action functions by one. The angle variables 
are given by 


dF 

where F is the function of the contact transformation, 
ween Ws and t/$i are given by 


(90) 

The connections bet- 


jrr _ dF dF djx_ dF dj 2 dF f/*-i • 

1 ~ a/i " dh a/i a/ 2 dh tin - r ah 9 

■ Vifri. y (91) 

Similarly, 


W% — # 2 /^ 2 , etc ; Wn - 1 — • 


W„ = = -(»! + »» + + t>„_i)/T„. (92) 

din 

The coordinates and the momenta of the system can be expressed in terms 
of /’s and vs by using the above results. 


Let us work out fully a simple degenerate case. 

For an electron moving in the field of a proton, 

r=i V (r 2 + r 2 (^ V = - eVr 
in the usual notation. *As 
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the Hamiltonian is 


'-£(*♦$) -f 


The Hamilton- Jacobi equation is 


2m L \ fir / r* ' 80 / -* f *’ 


Hence 




where 


and 




8F 

Ii = dr = $ 6r dr 

* 0 - f {A (ri - r) (r - r,)}^ = 2 ir ( — ”- - ~r “ « > ) • 
r (- 2 m air 

4- r 2 = -e*/ai, fif 9 = “ a 2 72mai, A = - 2m ai, 

h * $ fa dO * 2 7T a 2 , 

„ _ 2 ir 5 m e 4 

a \ — ii — — 


(/. + / 2 ?‘ 

The total energy is expressed as a function of l x and / 2 only. The frequencies 
are giveq by 

, dtf % 8H 

Ul " wT’ 3 ' % 

As o>i - o ) 2 = 0 the dynamical system is degenerate. Obviously we can 
write * 

2 * 2 m e K 


1ST - - 


f 7 * 


if we take 

7i * fi + ^s* 

The angle variable corresponding to /j is given by 
8F 


Wy 


8F dji 

-- ‘ — = Vi. 


81 k c)/ v % 


7 . RELATIVISTIC EXPRESSIONS 
* ® 

The vector 17 associated with the motion of a particle in special relativity is 


me 


dx 
ds ’ 


me 


dy 

ds 


dz 9 . dt 
mc •*' mC ‘ * 


(93) 
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where m is the rest mass of the particle, c the velocity of light, i* = - 1 and 
ds* » c 2 df — d* 2 -- dy 3 — dz 3 . (94) 

Hence the components of the vector are, if t> 3 « i 2 + y 3 + z\ 

mi my mi me i /g^v 

(1-vV) 4 ’ (l-t>V) 4 ’ (l-r7c ! ) 4 ’ (1-rV) 4 ’ 
or pu P- 2 , Pi, iE' c , 

where p lt p 2 , p 3 is the momentum vector and E is the Energy. 

+ p.? + p? - ffk * = mV, (96) 

E=m<?l( 1 — wVc 9 ) 4 . (97) 

For a photon or a quantum of radiation v = c and its energy can be finite 
only if the rest mass m vanishes. The condition satisfied by a photon is 

E* = c 2 (p? + p? + Pz )• (98) 

The Lagrangian L = T — F must be such that, for a particle, 


dL _ . 6L 
di dy 


- thy 



This is achieved by taking 

L ■ me [ 1 — ( 1 — v 2 l(? 


V. 


(99) 

(100) 


It is worth pointing out that the expression which replaces the classical 
kinetic energy in L is different from the relativistic expression of th€ kinetic 
energy. This latter is given by 





H 

§ 

n 

- vVc s ' 

)-*- 

0- 

(101) 

For 











d ( 

6L\ _ 

8V 






dt \ 

dx ' 

- Yx 

* 





A_( 

8L\ 

_ ev 






dt V 

dy ' 

dy 

* 





d ( 

8L\ 

- iZ 






dt t 


6z 



This 

gives 




• 




d ( 

8L' 

) • . d (8L 

\ . , 

d ( 

8L \ 

, dV 


dt v 

■ dx ' 

>' x+ dt\~6j 

m 

)y + 

dt V 

% 

di ) 

t+ ~df = .°- 

The 

resulting integral is* 






m<? 

( l - »■/<?)* 


+ V m const, 


( 102 ) 
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The kinetic energy is defined as in (101) firstly because it agrees with the 
expression of energy as in (102) and secondly for the reason that as c -»oo the 
classical expression of kinetic energy is realised Horn (101). 


8. THE FAILURE OF CLASSICAL MECHANICS 

The successes of the theory need not be recorded here. They are found 
in practically every branch of physical science although the most striking 
and sensational of the successes are claimed by astronomers who havei made 
accurate predictions of comets and planets. Discussing the failure of the 
theory we will not be concerned with astronomical situations such as the 
motion of the perihelion of Mercury as that will entail a digression into 
general relativity. About the beginning of the present century three major 
problems of atomic physics had defeated all attempts of solution on classical 
lines. They may be briefly mentioned here. Firstly, the atoms and mole- 
cules must be very stable structures if substances are to possess definite 
physical and chemical properties ; further, their internal structure must con- 
sist of charges fn motion as otherwise their characteristic radiations cannot 
be accounted for. No model of the atom could reconcile the stability, which 
was taken for granted in the kinetic theory of gases, with the motion of the 
inner charges as given by classical electrodynamics and mechanics. Secondly, 
it was a mere matter of faith that the statistical laws which a complicated 
system such as a gas obeys could be deduced from classical laws. This de- 
duction was never accomplished and the view was gaining ground that the 
laiys of large numbers in physics remain outside the ambit of classical mecha- 
nics. The third and the most discussed problem was that of heat radiation. 
An account of it is given in; a discussion of the quantum hypothesis which 
removed the difficulties of this probfem and explained a host of phenomena 
linked up with it such as that of unaccountably low specific heats at low 
temperatures. c 

REFERENCES 

*1. Thomson and Tait, Natural Philosophy, Vol. I, Part 1, pp. 240-9 (1879). 

2. Mach, E., Science of Mechanics, p. 340, (1893). 

3. Thomson and Tait, hoc. Cit., Vol. I, Part 2, p. 9 (1883). 

Weyl, H., Mind and Nature, pp. 60-80 (1934). 

5a. Osgood, W., Mechanics, pp. 345-55 (1937). 

5b. PearsOn, K., The Grammar of Science, pp. 235-75 (1937). 

6. Hartr^e, D. R., Mathematical Gazette, Vol* 19, pp. 73-8 (1935). 

7. *M*SES, R. von., 'Probability, Statistics and T rut ft', pp. 304-7 (1939). 

8. Russell, B., The Principles of Mathematics t p. 348 and pp. 482-8 (1937). 

9. Gchrqdinqer, E., Science and the Human Temperament, pp. 43-65 (1935). 



CLASSICAL MECHANICS 23 

10. Condon and Shortely, The Theory of Atomic Spectra, p. 429 (1935). 

11a. Osgood, W., Lac. Cit+ pp. 356-88 (1937). 

lib. Maxwell, J. C., Matter and Motion, pp. 145-61 (1920). 

12a. Whittaker, E. T., Analytical Dynamics, pp. 288-355 (1927), 

12b. Tolman, R. C., Principles of Statistical Mechanics , pp. 16-42 (1938). 

13. Eisenhart, L. P., Continuous Groups of Transformations , pp. 239 (1933). 

14. Dirac, P„ A. M., Quantum Mechanics , pp. 88-9! (1935). 

15. Birtwistle, G., The Quantum Theory of the Atom , pp. 56-79 (1926). 
16a. Jeans, J. H., The Dynamical Theory of Gases, pp. 406-15 (1921). 

16b. Ruark and Urey, Atom# Molecules and Quanta, pp. 176-9 (1930). 

17a. Birtwstle, G., Loc. Cit., pp. 48-52 (1926). 

17b. Frenkel, J., Wave Mechanics, Part 1, pp. 9-14 (1936). 



II THE QUANTUM HYPOTHESIS 

1. The Origin of the Quantum Hypothesis. 2. Einstein’s Relations. 

3. The Success of the Quantum Hypothesis. 

1. THE ORIGIN OF THE QUANTUM HYPOTHESIS 

Atoms have been talked of since the days of Democritus and Lucretius 
but the modem theory of discontinuous matter owes its origin to Dalton. 
Planck and Einstein 1 are jointly responsible for the quantum hypothesis which 
asserts the discontinuity of energy. The work of a series of investigators 
starting from Faraday’s laws of electrolysis has culminated in the postulate 
of the electron which implies the discontinuity of electric charge. On a sur- 
vey of the hlstoriopl. development the quantum hypothesis does not appear 
to-day as anything bizarre or revolutionary ; on the contrary it appears as 
a logical sequel to the earlier hypothesis of Dalton and in keeping with the 
speculations that were current in 1900 about the atomocity of charge. 

The physical situation which first suggested a mathematical formulation 
of the quantum postulate originated in the topic of the temperature radiation 

of $ black body. 

« 

A black body is an ideal body which absorbs all incident radiation there 
being no reflection, scattering or transmission. Lamp black approaches this 
ideal ; so also a hollow metal globe with a small hole in it as the chance of 
escape for any radiation entering it /s very small. In general, a body which 
is kept at a given temperature may emit a characteristic radiation without 
undergoing a structural change. This characteristic radiation is a pure tem- 
perature radiation since it depefids upon the temperature alone. Theoretically 
the study of temperature radiations is simplest for a black body. Classical 
mechanics and the law of equipartition of erfergy given by classical statistics 
led to Rayleigh’s formula, viz., 

.*■ / ( X, T) = cT X -4 , (1) 

although Wien had derived from, the classical thermodynamical principles 
and a speculative assumption of a Maxwellian distribution of velocities the 
alternative formula, viz., 

c ' I(\,T) = a\~ 5 e-* !XT i (2) 

* t ( 

here I (X, T) d X is the intensity of radiation for, wave lengths Between X 
and X d- d X at temperature T and a, b, c are certain constants. Rayleigh’s 
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formula was found to be true for large wave lengths but it gives very large 
values of I (A, T) for small values of X producing an appreciable diver- 
gence between theory and experiment. The other formula agrees beautifully 
with experimental results only for small values oi XT but for large values 
of the same the intensity function becomes too large. A generalized formula 
was required which would approach (2) for small values of XT, (1) for 
large values of X T and which would not violate the well-established Stephen’s 
law of radiation. By means of his hypothesis Planck succeeded in deriving 
the successful formula : 


I(\,T) = a\~ 5 /(e blXT -l). (3) 

It is in full aocord with experimental results. It escapes the ultra-violet cata- 
strophe produced by the Rayleigh formula and the infra-red disaster due to 
the other. 

In his derivation of (3) Planck assumed that the mechanism responsible 
for radiation consists of linear oscillators performing simple harmonic motion. 
A particle of mass m executing simple harmonic motion in a straight line 
satisfies the equation, 


MX + fi X = 0, 

(4) 

the force per unit mass being attractive and proportional to the distance 

from the origin. So 


x = a cos "f pirn t 

* (5) 

if the time is reckoned from a suitable epoch. The total 
instant is given by 

energy at agy 

E ■ i m i* *+ ijix 2 , 

(6) 

which is constant by the principle of energy. Denoting the frequency of the 

periodic motion by v and hence the period by 1/v we have 

1 2 ir 

v * ^ pirn 

Hence from (6) and (7) 

E = Av 

where 

A = 2 * * m a* v. 

It is interesting to note that as 

• • 
p = m x 

r f 2 r 1 */ (film) * • 

pd x * J • jJLC*sin* \/ film t dt = 

0 m 

(7) 

(8) 

(9) 


A. _ (10) 


4 


( 10 ) 
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In the problem of heat radiation the linear oscillators, representing the effect 
of a black body in equilibrium with radiation at a specific temperature, were 
supposed to exist in discrete states of energy so that 

A = nh, (11) 

where k is Planck's constant and n is a non-negative integer. Thus A could 
not be a continuous function and, in a state of equilibrium, energy was as- 
sumed to be absorbed or emitted in descrete quanta of multiples of hv. h t of 
course, has the small value 6-5 X 1( H 7 in c. g. s. units. The assumption (11) 
gave (3). 

Originally, the assumption was made for no other reason except that of 
mathematical convenience and enough justification was found for it in the 
result (3) and in several others. What is curious, however, is that the ex- 
pression for energy should have been split up in the .particular manner of 
(8) and (9). The explanation lies in the significance of A. The area of the 
ellipse obtained by plotting the position and the momentum in one period 
in perpendicular directions is A, as the result (10) shows ; and the far reach- 
ing consequences of Planck’s hypothesis are due to the fact that A is one 
of the action invariants discussed in (I, 85-85'). As all conservative systems 
of classical mechanics are governed by a continuous action function, the impli- 
cation of Planck’s hypothesis that atomic phenomena are similarly governed 
by the quantum of action K appearing in (11), augured very well for subse- 
quent research. 2 

The integer n is an important feature of Planck’s hypothesis. Kronecker 
once said that God made integers the rest being the work of man. We do 
not know to what extent Planck was under the influence of Kronecker’s dictum 
when he thought of the famous hypothesis. A pure mathematician intimately 
acquainted with prime numbers, might have insisted that n must be a prime 
number. There is no a priori reason why n should be an integer. It cannot 
however be denied that, with our habits of thought and with our experience 
of nature, a 4 simpler ’ statement of the occurrence of discrete values of energy 
is not possible. 

The full content of the quantum hypothesis is not described by (11). 
The existence of discrete states of energy for oscillators merely implies that 
absorption and emission are not continuous as far as their effects are obser- 
vable. The more daring hypothesis was suggested by Einstein in 1905 
that radiation, , itself is discontinuous ; the energy £ of a radiation with fre- 
quency v was supposed to be carried by unidirectional quanta satisfying the 
relation, < 


hv = E. (12) 

Ten years of experimental work on the photoelectric effect t of ligift had to 
pass before (12) was accepted. In 1928 Lawrence and Beams tried to 
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measure the lengths of the unidirectional quanta of radiation. They con- 
cluded that the lengths must be less than 1O- 0 cm. 

Einstein’s hypothesis (12) is vitally connected with another fundamental 
relation usually attributed to him, viz., 

E = me 2 (12) 

where E is the energy and m, the rest mass of a particle. The atomicity 
of matter is naturally expected therefore to imply the atomicity of energy. 

An immediate application of Planck’s formula of heat radiation was 
made by Einstein to modify, as experimental results required, the Dulong 
and Petit law of atomic heats. The precise form of Planck’s law (3) is 

I(X,T) x v fafryf - ( 13 > 

e - 1 

in the usual notation. Since 


Xv = c (14) 

the density of radiation, J (v, T), in the frequency range is given by 

}{v,T)d v - - I (X, T) dX 


foP . 1 

c® v® V hv/kT 

e -1 


so that 


J(v, T) 


fop _1 

<? hv/kT 
e - 1 


(15) 


The mean energy of an oscillator W is connected with this expression by 


w = j(v.n = 


kv 

hv/kT 
e -1 


(16) 


For oscillators with three degrees of freedom tfie mean energy would be thrice 
this amount. Hence the heat content per gramme-mole is 


- 3 Nhv 

E = hv/kT 

e 


(W) 


N being Avogadro’s number. For the atomic heat at constant volume we 
get the formula, first given by Einstein, 


C„ 


dE 

•dT 


(18) 


According to the lJulong and Petit law the atomic heats oi ail sub- 
stances are »he same at all temperatures. It had been discovered by experi- 
menters that the atomic heats are usually much smaller at low temperatures 
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and this fact could be explained by using Planck's formula for the mean energy 
of an atom in place of the law of equiportion of energy. By virtue of the 
latter 


E = 3 NkT 

which gives a constant value C v . 

With the help of the familiar formulae, 


R = kN 

and Ne * 9658-3- 10 10 E. s. u. ; 

the constant k of Planck's formula was used to determine the Avogadro 
number and the charge of the electron e. 

In spite of these successes the Planck hypothesis was looked upon with 
considerable suspicion for several years afterwards. It appeared more as a 
mathematical tnck than as a fundamental physical truth. There was an 
objection to the quantum theory of radiation on the score that unidirectional 
quanta could not explain interference of waves. That the oscillators do not 
satisfy the law of equipartition of energy was also considered to be an 
objectionable feature. A serious difficulty was pointed out by Lorentz in 
(1910) who showed that in a weak field of radiation an appreciable time 
would be needed for the absorption of a quantum. On the other hand, it was 
assumed in Planck's proof that the time of absorption is negligible. 

In yiew of these difficulties Planck himself did not countenance Einstein’s 
theory of light quanta in the beginning and his own original hypothesis he 
modified making t‘he oscillators emit discontinuously but absorb continuously. 
It was shown that Planck’s formula of heat radiation could be deduced even 
on the modified hypothesis. Two years later, that is, in 1913, Planck re- 
turned to the classical hypothesis that both emission and absorption are 
continuous. Later he reverted again to the second hypothesis. 

It is no longer necessary to postulate either linear oscillators or unidirec- 
tional quanta of radiation to account for the observed facts of heat radiation 
or the photoelectric phenomena. Like the grin alone that has survived of 
the Cheshire cat the hypotheses of Planck and Einstein, (11) and (12), 
retain thVritf importance as mathematical formulae but the mechanism which 
• suggested them is no longer in fashion. In fact physical processes as they 
are understood to-day in the light of wave mechanics are much too subtle 
to follow any 'mechanical model satisfying classical laws. When we speak 
of waveband linear oscillators in the quantum theory of radiation the exis- 
tence of such mechanism is not * implied, but c the terms are used because 
they are very suggestive descriptions of certain mathematical formulae and 
equations. 
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2. EINSTEIN’S RELATIONS 


Einstein’s relation 3 for light quanta, 

E - hv (12) 

should be studied along with the vector relation, also given by him for light, 


= h x (19) 

where p is the momentum vector and f the wave-number vector. These 
equations bring out the correspondence between the terms of corpuscular 
representation such as E and p and those of the wave representation, v and 
t. To understand the correspondence consider the relativistic vector for a 
particle, 


where 


MVx, mv y , mvz, imc ( = i E/c) 

K , v? + Vy + v? \jr 

m = 1 j ) 

and m 0 is the rest mass. Similarly, there is the vector 

k x i ky , k z f i V/i C 

associated with the plane wave 


(20) 

(21) 


4> =* cos 2 ir ( x k x + y ky 4- z k z - v / ). (22) 

It is known that a Lorentz transformation leaves 

m ( x v x + y v y + z v z - E tlm) (23) 

invariant. Hence the phase of the plane wave can remain unchanged under 
a Lorentz transformation in form and value if the two four-vectors satisfy 

kx _ ky _ k z _ v _ 1 (24) 

mv x mvy mv z Eh' 


where h is a function of the invariants, • 


k x + ky -f ki — ^ and m 2 ( v x + v 2 + v? - 2?VmV ). 

For light both the invariants are zero (see I 98) and h may be taken as* a 
constant, Planck’s constant, in particular. Putting 

T * (k x ,ky,kx), p ■* m (v x , Vy, Vz) (25) 

we have the two relations stated in the beginning. They are assumed to be 
true for corpuscles of matter as well. Op this assumption was based the 
original work of de Broglie. # An important contribution by Einstein tg phy- 
sics was^the derivation of (12) and (19) by •reconciling the particle and 
wave representations of light ; the extrapolation of these results for small 
particles of matter was done later by de Broglie, 
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Supplement 

In case the invariance of (23) is not obvious the following straightforward 
proof 4 ,may be read. 

Fof'a Lorentz transformation, 

, x-ut , _ , f t-ux/c 3 

* = (1-mV) 1 ' 5 ’ y ~ y ' z = z * r ■ (i-mV) 1 '” 

t Vx*-u , __ ^(l-wVc 3 ) 1 ' 3 , _ Ut(l-ttVc 3 ) 1/3 

* 1 -uv x lc 2 ' Vy 1 -uvj<? * 


where t;, = dxldt , t;, = dxldt, etc. Also v 2 = t>,* + t> y 2 + t>, 3 , 
and similarly v is defined. It may be verified that 


xv* + yv y + zv x ~ cY 


( xv* + >t> y + zv x - <?t) ( 1 - tf!<? ) l/1 
( 1 - uvj(?) 


and that 


Hence 


(1- t?/c 3 ) 


(1- kV)(1- W ) 
(l- uvj<?y 


ix + y iy + * 1 
= p t [*v x + yv> + zv t - <*t]. 


3. THE SUCCESS OF THE QUANTUM HYPOTHESIS 

In discussing the failure of classical mechanics we have described three 
difficulties which the theory could not successfully combat. As we have 
already seen Planck’s hypothesis explains perfectly the observed rise and 
fall of the intensity of black-body radiation with wave length. It was 
natural to expect that the same hypothesis might account for the statistical 
situation^ also. For the resistance® of metals and for the heat of rotation of 
diatomic molecules either the law of distribution of energy with temperature 
as given by Planck’s theory was takfcn as true or the assumption $pdq = nh 
which Planik made in the particular case of an oscillator, was used to 
regulate a more complicated mechanism. A certain measure of success was 
attained in most of these cases but nowhere was it as complete as in the 
problem of heat radiation. *It looked as if a good clue was obtained in 
Planck's hypothesis but that more clues were needed before* statistical pro- 
perties Tty atoms and molecules could be clearly understood. 
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The third difficulty, caused by the remarkable stability of atoms and 
molecules, was brought nearer home by the new experimental findings. Al- 
ready a model of the atom was proposed by J. J. Thomson according to 
which the? positive charge is distributed in a sphere of radius 10-* cm., and 
the negative charges which are electrons are elastically bound tg it in 
positions of stable equilibrium. When the atom is slightly disturbed negative 
charges execute small oscillations which were supposed to explain the spectra. 
However this failed to provide an explanation of the simple formula of 
Balmer lines of hydrogen, viz., 



where v is the frequency, R the Rydberg constant and n, a positive integer 
exceeding 2. Nor was the Stark effect on spectral lines in an electrical field, 
discovered during the early formative .period of Bohr’s theory, understood 
by means of the Thomson model. Moreover, Rutherford’s experiments on 
the scattering of a particles showed very pronounced deviations in their 
orbits which could be attributed only to the supposed concentration of posi- 
tive charges in the centre of the atom. The experiments being of a decisive 
nature the Thomson model was replaced by Rutherford’s 9olar-system model. 
The idea of a positively charged nucleus with a number of satellite electrons 
goes back to Rutherford. Although the scattering experiments strongly 
favoured this model there were serious objections against it. For instance 
the dynamical stability of a satellite electron is not possible if the classical 
theory of radiation due to moving charges is true. At this stage Boh* entered 
upon the scene and although he could not meet this particular objection 
he showed how Rutherford's model, modified by the quantum hypothesis, 
can explain the hydrogen spectra, the Stark effect and several other pheno- 
mena. Incidentally it may be observed ithat Rutherford’s scattering formula 
provided a stimulus in the right direction, in 1913, when Bohr’s theory 
was in the offing and an effective test in 1926, for verifying Bohr’s probabi- 
lity interpretation of Schrodinger’s wave function. 
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Ill FIRST QUANTUM THEORIES OF MATTER AND 
RADIATION 


1. Postulates and Hypotheses of Bohr’s Theory. 2. Successes of Bohr’s Theory. 
3. Einstein’s Theory of Radiation. 4. Failures of the Two Theories. 

1. POSTULATES AND HYPOTHESES OF BOHR’S THEORY 

Bohr’s theory, as it was first advanced in 1913, had to be considerably 
elaborated by Sommerfeld and others to meet with the growing demands 
of spectral evidence. Let it be clearly understood therefore that it is the 
postulates and hypotheses of the more elaborate theory, which had grown to 
its full stature in 1923, that will be considered here. They 1 are as follows : 

(A) An atom can exist in any one of a possible set of stationary states, 

each state having its own constant of energy. 

(B) When from a state of higher energy E 2 an atom falls to a state of 

lower energy E 1 a quantum of radiation is emitted with fre- 
quency v, given by 

hv - ft - El ( 1 ) 

Similarly, an atom rises to a state of higher energy with the 
absorption of a quantum of radiation, and the increase in energy 
equals the energy of the quantum. 

(C) The laws of classical mechanics apply to the atom. But in the 

case of conditionally periodic systems the action variables satisfy 
the quantum conditions of Wilson and Sommerfeld, 

$ pdq = / = nh, (2) 

n being a non-negative integer. There are as many quantum 
conditions as there are independent co-ordinates. For degene- 
rate systems the reduced number of independent action variables 
satisfy the analogous^ conditions 

* r * J * nh. ( 3 ) 

• 

(D) ‘ The moving changes in ‘the atom obey the laws of classical electro- 

dynamics. The state of polarization of an emitted quantum of 
radiatioiT and the question whether a transition is possible or 
forbidden are decided by Bohr’s 'correspondent principle. 
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Every atom is distinguished by its atomic number which is some po6i- 
* ** vc * n ^8 er m » that means it has a positive charge me in the nucleus and m 
9 electrons, each with a negative charge -e, rotating about it. Thus H has the 
atomic number 1, He has 2, Li has 3 and sd on. The original classification 
of the elements of the periodic table was made by Mendeleeff on the basis of 
atomic weights but Moseley’s conclusive studies of atomic spectra made it 
abundantly clear in 1913 that the periodic properties are dependent upon the 
atomic number. These facts were used in developing the theory of the atom 
on the assumptions stated above. 

There is enough evidence that atoms exist in stationary states. That a 
stationary state can be characterized simultaneously by absence of radiation 
and moving charges is amply borne out by paramagnetic atoms like bismuth 
and ferromagnetic atoms like iron, cobalt and nickel. The apparent paradox 
is now resolved by the assumption of electronic spin. 

In elucidating (B) some important considerations have to be presented. 
When the energy of the atom is the least possible it is said to be in the ground 
state ; any other possible state is an excited state. As will be seen later two or 
more states may correspond to the same energy. The various states are usually 
characterized by different electronic orbits. Certain transitions from one level 
to another are highly improbable and they are called ‘ forbidden \ The theo- 
retically forbidden lines may be detected with finer instruments. Whenever 
this has happened it has given a further proof of the accuracy of wave mecha- 
nics, these smaller and ‘ neglected ’ terms of calculation accounting for the 
fainter and ‘ forbidden ’ lines. It is possible, however, for an atom to change 
from one excited state to another, both characterized by the same energy, ihc 
distinguishing feature being different electronic orbits in the two states. Such a 
transition is radiationless and is describe^ as Auger effect. What actually hap- 
pens is that a normal atom exposed to X-rays gets into an excited state from 
which it sinks to a state of lower energy by the emission of a quantum. This 
quantum is further absorbed by one of the » outer electrons usually making 
its orbit hyperbolic ; and moving in the hyperbolic orbit the electron speeds 
off to infinity leaving the atom ionized. Thus the transition from one state 
to another is marked by an absence of radiation. 

The quantum numbers are introduced by (2) and (3). For Condition- 
ally periodic systems the action variables / have been proved to be adiabatic 
invariants. So the quantum numbers also are adiabatic invariants of such a 
system. Hence the same quantum numbers can be used iof ^ascribing the 
various excited states of hydrogen, helium or any other atom. *The # adiabatic 
invariants of a system are certain unchanging structural relations which exist 
even when the system dianges— very slowly, no t doubt —through the variation 
of certaih physjpal parameters. The quantum numbers, therefore, play the role 
of the tensor invariants of general relativity in the present theory. «tay con- 
5 
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ditionally periodic system can be shown to be obtainable from a system of 
harmonic oscillators by a slow variation of certain parameters. Thus the 
Wilson-Sommerfdd conditions (3) are equivalent to Planck’s hypothesis for an 
oscillator. In the case of a degenerate system the smaller number of independ- 
ent action variables J are shown by Burgers to be adiabatic invariants. 

It has been already shown that for mechanical systems such as those 
considered by Bohr 

&£ = S u> */. (4) 

For a transition from one set of large quantum numbers n to another set n' 
such that n ~ »' is small compared to n and W we can write (4) in the form 

8J 5 = hi ( n' - n ) u>, 

or, using (1) 


v = S ( »' - » ) u>. ( 5 ) 

Hence for large quantum numbers the Bohr frequencies are the same as those 
given by the classical theory. But (5) would not be true for small quantum 
numbers as in such a case &/ would be comparable to / if expressed as t h where 
x is an integer and w cannot be considered to be unchanged for a finite varia- 
tion in /. However, by Bohr’s correspondence principle, (5) is assumed to 
be true for small quantum numbers as well. Secondly, only certain frequencies 
Etu) appear in the Fourier expansions of the co-ordinates of a mechanical 
system. * The same frequencies v are supposed to be allowed in Bohr’s theory. 
Hence the transition from one level to another occurs only if W - n] has the 
same set of values as x in any of the frequencies. Thus arise the selection 
rules of Bohr’s theory. In this manner the correspondence principle restricts 
the possible transitions between two levels, the allowed frequency of emitted or 
absorbed radiation being a possible frequency according to the classical theory. 
It also makes the quantum condition (1) equivalent to the’ classical relation 
(4) for large quantum numbers. The polarization of emitted radiation is 
determined by calculating the direction of the electric intensity vector as in 
the classical theory, the frequency v being connected with the quantum num- 
bers by (5). 

At die common boundary of the microscopic and mecroscopic domains the 
equations of classical mechanics must be equivalent to those of Bohr’s theory. 
This is achieved by the correspondence principle. The discontinuity of quan- 
tum process and the continuity of classical laws are apparently irreconcilable 
but the correspondence principle harmonises the two at the meeting ground 
of the two theories. Logic assigns an importaht place to this principle in 
any theory of atomic, laws ;„and so we shall have 1b discuss it again when 
we consider Schrodinger’s and Dirac’s theories $nd also certain qu&tions of 
radiation. 
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2. SUCCESSES OF BOHR’S THEORY 


The outstanding successes 8 of Bohr’s theory will now be briefly considered. 
For our purpose the following topics need be taken up : 

OD The spectra' of hydrogen-like atoms. 

(2) The Stark effect. 

(3) Collision experiments of Franck and Hertz. 

By a hydrogen-like atom is meant one with a nucleus of mass M and 
positive charge Ze, there being one satellite electron of mass m and charge 
•e. Neutral hydrogen, singly ionized helium, He+ and doubly ionized lithium 
Li^ are examples of such an atom. For hydrogen, M/m = 1845, approximately. 
M being in general several thousand times as great as m the motion of the 
nucleus may be neglected in the first instance. Assuming the Coulomb law 
of electrostatic attraction we have for the kinetic energy and the potential, 

T= |m(r 2 + r t 0 i ), V = - — • 

7 

QT qt 

h = Q.=mr, = mr'O. 

Hence the Hamiltonian is 


H " T+v -2il>’ + 7*']-T- 

If the constant of energy is the Hamilton- Jacobi equation becomes 


+ *(»)■] 




= oii- 


Putting 


ft = 


8F 


d$ ~ 15,2 » 

— £-* -[*.(«• + ")-?'] 

Hence the quantum conditions are 


Ze 2 


■*-*/» 


( 6 ) 

P> 


I? x $ pi dO = 2 ir = n,h, (8) 

I\ = Pi dr = if <fr[ 2m ( « » + ) - p- * nfh. (9) 

Eliminating a a between the last two equations we get for the total energy, 


E = a, = 


2 x s mZV 


( 10 ) 


where « n * ni + »,. ( 11 ) 

As we Hhve shqwn earlier tijis is a case of degeneracy* with only one action 
invariant J = I x + The frequency of emitted radiation is givoKby 
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«ta. « - 2 -^f” (13) 

and s and n are positive integers so that n > s. R is the Rydberg constant 
whose value agrees with the empirical estimate of the same. For Z = 1, s *■ 1 
gives the Lyman frequencies and s = 2, the Balmer series. 

If the motion of the nucleus is not ignored the value! of the Rydberg con- 
stant becomes 

2tt V m 

h'c ' / . . m \' (14) 

\ M/ 

As M increases with the atomic number the constant increases in value. This 
is in perfect agreement with the results of spectroscopic observations. 


A closer examination of the hydrogen spectra by a spectroscope of high 
resolving power however reveals a fine' structure which could not be explained 
by the simple formula (12). Sommerfeld showed that if we start with the 
relativistic Hamiltonian, in place of (6), a formula for the fine structure 
is obtained. 


He! started with the Lagrangian 

L T T-v.me[ i - ( i - rli'iy'-] + |!, (l5) 

t dL mi ^ dL m/d 
getting px - ^ * ( 1 _ P* y * ■ A * 80 ' m (1 - 

where p 2 = (r 2 + f r 2 6 3 )/<*. 

The Hamiltonian is 

/ *2 vi /2 

ip, + rO p, - L ^ c ( mV + p? + - y- (16) 

Substituting dFjdr for pi and OFjdS for ft in the energy integral, 

(17) 

9 

we have, the Hamilton- Jacobi equation 


/0F\* , \/6F\* 1/ , Ze*w , 

Car) + 7<6*) = 7\ a 1 + 7 ) (*' + T + 2wc )‘ (18) 


We can (ake‘ 
to that , 


dF 

ae 


a !( 


(19) 


L* 4 — dr = 2*2? (m + *l/£JL r ~o~ (a.’-ZV/c 8 )'' 3 (20) 

4 f * dr ar (-«, (2m + «,/<*) T* Z,r U ‘ ^ r/£; * lZ0J 
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7 » " *M d0 “ 2 * «*• (21) 

Eliminating a 2 between (20) and (21) we get the energy E : 

E m _ _ _ wt**V2* f . &VZ 8 / 1 , \ ) 

1 2 (/, + /,)* 1 1 + (/, + I,y ( 4 + h) i (22) 

where p « 2n eVch. 

ji is the so-called fine-structure constant and its value is approximately 1/137. 
The quantum conditions 


h = n\h, / 2 = «j h, 


give 


E = - tnfySZ* 

2 (wi + mi)* 


{ 1+ (»,*+ «,) J ( 4 + 2)} 


(23) 


It is clear from the expression (22) that this is not a degenerate system and 
that two independent quantum numbers characterize each energy level. Writ- 
ing in the familiar notation, 


tii 4- n<> = n, n« = k (24) 

and 2v?tne K lch z = R, (13) 

«--«*<* {A +£(«-{)}. (28 

Hence the line of frequency v as given by (12) is replaced by a series of 
lines with frequency v + A v so that 


v + Av 


^7 - v) + a 





or 4,.it fa zy[-‘(i-*)-.>.(-f 



(26) 


k and k' being the quantum numbers corresponding to / 2 in the two states. It 
looks as if there is a double infinity of lines pi emission corresponding to a 
transition from the state n to the state s since k and k' are still free. But we 
know from (24) and from the result that the zero value of n 2 makes the orbit 
run through the origin that 

0 < k < n, 0 < k' < s. (27) 

There is a further restriction put on the relative values of k and k' by the 
correspondence principle. For we know that the frequency which is given by 


( s - n ) oh + ( k! - k ) <0 2 (28) 

must be a possible frequency according to* the classical theory. Thfe follow- 
ing equations have now«to be recalled : 

♦ dF „ r dF % 
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Fronl<the form of F we find that in the expansions of the coordinates! of the 
problem, x, y or r, B frequencies only of the type 

TtO! 4* u> 2 (30) 

enter. Hence the selection rules are derived by comparing (28) with (30) ; 

| n - n' | can have any integral value while | k - k' | can have only the value 
unity. Thus 

k-k' * ±1. (31) 

The conditions (27) and (31) control the multiplicity of the values of Av 
as given by (26). 

The problem of the Stark effect is concerned with the motion of an 
electron moving under the Coulomb law of attraction to the nucleus and a 
uniform field force. Neglecting the complications caused by the presence of 
other electrons and the small relativistic correction the Hamiltonian is deduced 
from 

T + V ss \m [ z 8 + p 2 + pV 2 ] - + F 0 ez, (32) 

Y 


where cylindrical co-ordinates p, <t > > z are used and r 2 = pi 2 + z 2 . F 0 is the in- 
tensity and Oz the direction of the uniform field of force. When these co- 
ordinates are used our results for a conditionally periodic system cannot be 
applied ; the adiabatic invariants which are necessary for quantizing the 
system do not exist. If, however, parabolic co-ordinates are used the quantum 
conditions can be set down and the theory applied to solve the problem. This 
naturally raised suspicion as to whether the quantum formulation of mechanical 
problems depended on the choice of the co-ordinate system. The discussion 
that followed was terminated by Einstein who showed in 1917 that the quan- 
tum conditions can be expressed in a form independent of the co-ordinate 
system chosen. When parabolic coordinates are U9ed the expression for 
energy becomes 


E - - 


2n % m&e K 


3 Fp 


(/, + J* + J 5 ) 8 8t JmZe 


(li-/.)(li + ls + U (33) 


where I v / 2 , / 8 are the action invariants. Putting I t = /* = n a h, I t = 

Us A, we get £ in terms of the quantum numbers. As the system is degenerate 
we use /he transformation, 


so that 


+ h = h = nh, V 
- 1 /, = h - kh, ) 

(34) 

2 3 Ftfnk 

(35) 

n*h* Sn 2 mZc 


It is the correspondence principle that will .have to be iped to determine 
the r&tr.ctions on possible transitions. That means the solution of the Harail- 
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ton-Jacobi equation, F, is expressed as a function of 7i and /*. The kngle* 
variables l&u are then given by 

* - g- * - 1 • <* 

As = u)V 4- »i, ^2 * + » 2 , (37) 

it is possible to express x + iy and i as expansions of the form 

1A ,2 m 

The conditions for polarization parallel to Oz are given by the expansion for 
z. Those for circular polarization are found from the expansion for x + iy. 

If the energy level is supposed to be' characterized by the principal quan- 
tum number only, any transition from one value of it to another gives not 
merely) one line, as the formula (35) shows, but more lines due to the second 
quantum number k for each of the two states. This phenomenon, described 
as the splitting of the n-s line, was found to be in complete accord with ob- 
servation in 1917 and it inspired great confidence in the theory. 

If an atom consists of many electrons the energy of the atom as a whole 
can be considered to be, by a mathematical artifice, the sum of the energies 
of the satellite electrons. The artifice consists in compensating the 
terms of electronic interaction. The quantum hypothesis implies the exist- 
ence of discrete levels of electrons correspohding to different energy values. 
An atom becomes excited as one or more electrons of the ground state change 
their levels under an external stimulus which imparts just the energy requisite 
for excitation. This theoretical inference was quantitatively verified in a series 
of collision experiments carried out by Franck and Hertz in 1919 and later 
on, by other investigators also. Consider* in its ground state or state of lowest 
energy an atom like H or Li, or Na in which there exists one outer electron, the 
other electrons, if any, being bound in closed shells or sub-shells to the nucleus. 
We can denote possible levels of increasing energy for this electron by positive 
integers. The least energy needed for the electron to change its level is the 
difference in energy of the two lowest levels between which a transition is prob- 
able. The emission for a transition from the higher level to the other giv« 
the resonance line. Now Franck and Hertz found out that if electron^ possess 
an energy less than a critical value Ec they suffer an elastic impact in a colli- 
sion with an atom of the type described above. If, however, the electrons pos- 
sess the energy Ec there is an inelastic impact, the total energy is c%ir.»unicated 
to the atom and its outer electron jumps on to the next higher* level. The 
observed value of Ec agrees perfectly with* the energy difference of the first 
two levels. Agreement w3s also reached with the, theory, in more complicated 
experiments dealing with other excited states of atoms obtained by atom- 
electron collisions. 
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In cataloguing the successes of Bohr’s theory we have avoided any re- 
ference to the properties of the periodic table. It is true that the classification 
of the elements in the periodic table was done by means of four quantum 
numbers, in 1925, when wave mechanics was still unknown. But giving credit 
to Bohr’s theory for this explanation of the periods of the elements will not be 
right. We will not discuss the quantum numbers and the periodic table here 
as this question has to be taken up again. However, this is just the place 
where one may record that many of the early successes of the theory 
turned out to be failures in reality, when the underlying phenomena were 
examined with greater instrumental precision and in the light of fresh evidence. 
As an example we may consider the explanation of the fine structure of the 
Ha line of hydrogen as obtained on Sommerfeld’s theory. The five com- 
ponents as spectroscopically determined were given by the theory but the 
selection rules derived from the correspondence principle did not permit all 
the five of them. Moreover, if the fine structure were due only to the differ- 
ence in relativistic effects on the masses of the moving nucleus and electron 
the fine structure of Na could not be accounted for. As was seen later the 
explanation of the alkali doublet and of the five components of Ha rests on 
another relativistic effect, the spin of the electron. 

3. EINSTEIN’S THEORY OF RADIATION 

Einsteins work on light quanta wa9 hailed as a triumph of Newtons 
corpuscular theory in several orthodox circles. The quanta of radiation are 
howew essentially distinct from Newton’s corpuscles of light. The quanta 
behave as particles always moving with the velocity of light ; their rest mass- 
if zero and in free space their energy and momentum (scalar) are connected 
by the relation 

Etc = p\ (39) 

they do not obey the classical deterministic laws of motion. On the other 
hand, Newton’s corpuscles moved with changing velocities in a field of force 
or in a varying medium, theif rest mass was not zero and they were subject 
individually to the law of causation. Experiments have now definitely dis- 
closed that the quanta of radiation also carry angular momentum with them. 
•Thus a beam of light failing on a screen may set the latter in rotation through 
the angular momentum of photons. The corpuscles ©f the classical theory 
were not supposed to carry angular momentum individually in this manner. 

The two quality relations 4 on which Einstein’s theory is based have been 
previously jated. They are 

E » /tv, p * ftx* (40) 

These relations of energy and momentum combined with the principles of 
conservation of momentum 1 and energy explain the photometric fjhenomena 
and^tty? Compton scattering which could not "be understood on the classical 
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wave theory of light. If the energy and momentum were carried by waves, 
as was believed on the wave hypothesis, it appeared a, mystery how: even a 
faint beam of monochromatic radiation of sufficiently high frequency could 
pull out the outer electrons of an exposed metal when an intense beam only 
of a frequency less than a critical value completely failed to do so. Einstein 
succeeded in unravelling the mystery by means of the simple energy equation, 

i m v* = hv - P, (41) 

where v is the velocity of the ejected electron of mass w, hv is the energy 
of the quantum of radiation and P is the binding energy of the outer electron 
in the atom combined with the work done in pulling the electron against the 
surface force of the metal. If hv < P no electron can be pulled out. The 
equation clearly shows that the effect is controlled not by the intensity of the 
beam but by its frequency. Not only metals but non-metals and even inert 
gases can have their electrons pulled out by radiation provided the frequency 
is high enough. For radiations of the X-ray range, that is, for wave lengths 

o 

less than 100 A and exceeding 10* 12 cm, the Compton effect is witnessed in 
their interaction with the inner or bound electron of the atom. Suppose that 
an electron at rest is imparted a velocity v by a quantum of frequency v 0 
and that as a result of the impact the scattered light has frequency v. The 
conservation of energy is fulfilled by 

Av» = ftv + me* [ ( 1 - #7e*)“*-l] t (42) 

while the conservation of momentum in two perpendicular directions in the 
plane of motion is contained in 

(43) 

(44) 

0 is obviously the angle of scattering. Eliminating v and <t> we get 


— = ~ cos 0 + mv (1 - t/ 2 /c*) cos <#>, 
c c 

% 

o s — — sin 0 + mv (1 — t^/c 2 ) ^ sin <f>, 
c 


where 


y 

a 


Vo 

'(1 + 2 a sin* 012) 
hv o 


The wave-length shift due to scattering is 9 

The maximum scattering corresponds to the aqgle 6 = w/2, m wnicn case 
it is h/nSc, Thfc term is known as the wave length of the Compton shift 
Similarly, many other phenomena such as fluorescence and the Raman effect 
6 
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discovered in 1928, can be understood in terms of the light quanta in so far 
as only the energy and momentum properties of radiation are involved therein. 

Einstein’s theory Was not developed so as to provide laws of space-time 
behaviour for the photons. In the early stages of Schrodinger's wave 
mechanics it was customary to speak of the probability of position for a pho- 
ton. This concept was not fully worked out like the coignate one of the pro- 
bability for an electron and it did not make any contribution to the under- 
standing of radiation. Heitler* makes the remark : “ But there is no indi- 
cation that, for instance, the idea of the ‘position of ai light quantum* ; (or 
the ‘probability for the position’) has any simple physical meaning”. 

Photons are indistinguishable and classical statistics cannot therefore be 
applied to them. We may apply either the Fermi-Dirac statistics or the 
Einstein-Bose statistics. Logically the two are on equal footing and it is 
not known why photons follow the Einstein-Bo9e statistics. Planck’s law of 
black-body radiation cannot be deduced if the alternative statistics is as- 
sumed for the quanta of radiation. 

This distinction between the three statistics can be indicated by means 
of a trivial example. Suppose that there* are two atoms A and B and three 
places a, b, and c for them. The following different distributions are 
possible : 

Classical Statistics : 

A, abeabaebe 

B, abcbacacb 

* 

Einstein-Bose Statistics : 

• A, abcabc 

B, a b c b c a 

Fermi-Dirac Statistics : 4 

A, a b c 

B, b c a 

According to the classical statistics as A and B are distinguishable there are 
nine different arrangements possible. But if one cannot distinguish A from 
B only the six different arrangements of the Einstein-Bose statistics are ob- 
sfcrvabl^. If, however, the Fetmi-Dirac statistics is employed it means that 
there is a further bar on the indistinguishable atoms which prevents two of 
them from being in the same place. The respective distributions tell us that 
the probability of one atom being' in a and another in b is 2/9, 1/6 or 1/3 
accordinftq'the nature of the statistics followed. 

We'will not indulge here in a ‘derivation of Planck’s formula based on the 
Einst&n-Boee statistics. A property of the photons that was made use of 
in the classical derivation ought to be noted hq-e. Just lik^ the dtfctron the 
photon kaa three coordinates to mark out its position in space. The electron 
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is further distinguishable by two possible spin values ± 1/2. Similarly two 
perpendicular directions are associated with the photon to describe its state 
of polarization. Both the directions are perpendicular to the line of motion. 
Any other direction of polarization can then be described in terms of probabi- 
lity. 

Einstein's theory was by no means a complete theory of radiation. It 
had to be supplemented by Bohr’s rules of radiation deduced from classical 
electrodynamics and the correspondence principle. In spite of the many 
successes of Einstein's theory and of the correspondence principle it appear- 
ed impossible to, reconcile the two theories with each other. There was also 
the difficulty of harmonising the corpuscular theory and the classical wave 
theory of light. We shall now proceed to state the major difficulties that these 
first theories of matter and radiation confronted and that they could not sur- 
mount. 


4. FAILURES OF THE TWO THEORIES 

Nature reveals some phenomena and provokes the scientist to find the 
logical key to them. “Guess” says Nature and the scientist puts forth an 
intelligent guess. “You are quite clever but not right, so guess again”, 
comes the challenge of Nature who discloses some more secrets and the 
scientist guesses more accurately. And Nature smiles and nods patronizingly. 
More secrets are revealed and each time the scientist uses all the resources 
at his command to give a more accurate answer. This goes on till at last the 
scientist finds his progress arrested by natural limitations. In the ^progress 
of the: modem theories of atomic physics one witnesses the^game very vigor- 
ously played. • 

Einstein's theory of the quanta provoked two questions : 

( 1 ) What is radiation ?— waves of corpuscles ? 

(2) Is radiation spherical in space or directed ? 

The first question arose because interference and diffraction phenomena 
could be explained only on the classical wav# theory of radiation while the 
photoelectric and Compton effects definitely contradicted that theory. The 
theory of photons, on the other hand, accounted for the effects but proved to 
be definitely ineffective in the treatment of the other phenomena and simple 
results of the wave theory such as the formula for the resolving power of a 
telescope could not be derived in terms of photons. Briefly, the situation was 
this ; the kinematical effects were explained? by the wave* theory, the dynamical 
effects by the particle theory. A logical synthesis of the two peached in 
the quantum theory of radiation where ih% dynamical effects are realized by 
the rules of quantization superposed on the classical theory giving thj kine- 
matical effects. One thing the examination of this particular difficulty brought 
to light before the advent of the more abstract theories of Heisenberg and 
Dirac, viz., that the love of mechanical models must be sacrificed tb the, 
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greater love of logic if the complicated phenomena of matter and radiation 
are to be understood as rationally interconnected. 

According to the classical theory an electric oscillator radiates energy in 
spherical waves. At great distances from the oscillator the energy being spread 
over a large surface is very small per unit area. This goes against the observed 
result that the energy of radiation even at great distances is found to be 
hv in interaction with matter. Considering the energy exchanges in a state 
of equilibrium between Bohr atoms and radiation Einstein has established 
the law of black-body radiation. But considerations of momentum exchanges 
have shown that the radiation emitted by the Bohr atom has the character of 
a bullet. This is certainly in agreement with the photon hypothesis. What 
is curious however is that the character of the radiation should be correctly 
describable by means of selection and polarization rules when the momentum 
and energy are governed by the photon theory. In answer to the second ques- 
tion regarding the nature of radiation one had to admit that it might consist of 
tiny sets of directed waves at times and at times of spherical waves as given by 
classical electrodynamics. Thi9 question brings out more fully the significance 
of our earlier remark that the kinematical effects are given by the wave theory 
although the dynamical ones call for the photon hypothesis. In the absence of 
a complete logical picture of the abstract processes of radiation our know- 
ledge of the subject was in an extremely unsatisfactory state in 1925. 

Bohr’s theory of the atom was considered unsatisfactory in 1925 for the 
following reasons : 

(1) T Approximate solutions of the many-body problem showed a dis- 
crepancy between the observed energy levels and the calculated ones. The 
ionizing potential for He, for instance, was quite different from the calculated 
one. (If V is the ionizing potential — Ve is the energy of the outermost elec- 
tron). Moreover, the electric orbits as given by classical mechanics are 
not dynamically stable. 

(2) The existence of stationary states is in sharp contradiction to classi- 
cal electrodynamics. An electfon experiencing an acceleration loses energy 
by radiation. The continuous loss of energy would ultimately bring the 
electron to the nucleus. The main problem of the atom is to reconcile the 
ekistence of stationary states with moving charges inside and this remained 
untouched on account of the convenient but faulty assumption that electrons 
move in their quantum orbits without a loss of energy by radiation. 

(3) There was no rule for calculating the intensity of emitted lines. The 
correspoflfleriGe principle gave accurate results only in special cases. 

(4) « The problem of scattering of light by # an atom presented very grave 
difficulties. An atom was treated as a multiply periodic system and it was sup- 
posed to be disturbed by radiation. The result of the disturbance oq, the scat- 
tered light was found to involve the orbital peribds of the eltctrons. But the 
periods are qot observable and hence the solution of the problem of scatter- 
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ing was partly couched in unobservable terms. In the study of the problem 
of dispersion in Bohr's theory, Heinsenberg was very much impressed with the 
absolute necessity of keeping all unobservables out of the theory on account 
of the unwelcome appearance of orbital periods in the results calculated for 
observational verification. 

(5) A number of difficulties had accumulated during 1913-25 which 
could only be removed later by the hypothesis of the electron spin. There was 
the question of half quantum numbers without which the mutiplet structure 
and the molecular spectra could not be accounted for. The separation of 
components in the anomalous Zeeman effect as compared to the separation of 
the normal components introduced the mystery of the Londe'g-factor. Theories 
of ferromagnetism, electrical conductivity and valency were too rudimentary 
yet. 

To study all this from the standpoint of Bohr’s theory is purely of 
historical interest. It will suffice to remark here that Bohr’s theory had to 
be continually modified to accommodate new experimental facts during the 
twelve years preceding 1925 and it became increasingly clear about thi9 time 

(i) that the methods were not competent enough to cope with some situations 

(ii) that they led to wrong results definitely in certain cases and (iii) that 
there was no hope of building a synthetic logical structure out of classical 
mechanics and electrodynamics, ad hoc quantum hypotheses and the cor- 
respondence principle. For all these reasons the theory ceased to provide 
an incentive and a lead to further experimental work. 

More important conclusions followed from this than the. one that Bohr’s 
theory must be scrapped. The failures of science prove to be the earnest 
of a coming great success. The circumstances were favourable for a new 
outlook in physics, for the scientist to leap forth and hazard a new guess. 
Above everything else Bohr’s theory repealed the inadequacy of mechanical 
models in physical representations. The microscopic domain cannot be ex- 
plored part by part so that in every part laws df classical mechanics hold 
good, the parts! themselves being bound together by laws outside the pale of 
the classical theory. 

The flames of thought that were to illumine the field and remove these 
obscurities were provided by de Broglie, Schrodinger, Bom, Heisenberg aiyi 
Dirac. 
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IV WAVES AND PARTICLES 


1. A Fundamental Analogy. 2. The Schrodinger Equation. 3. The two Inter- 
pretations of <j>. 4. Some Collision Phenomena. 5. The Kepler Problem. 

1. A FUNDAMENTAL ANALOGY 

The main function of analogies is to suggest an inner logical form. There 
is no harm in using them in scientific expression on the definite understanding 
that they prove nothing and that the attendant particulars are irrelevant. In 
the following is set forth, in detail, a fundamental analogy between geometrical 
optics and classical mechanics which has been responsible for the Schrodinger 
equation and a new philosophy of matter. First we expound the connection 
between the classical wave equation of physical optics and Fermat’s principle 
on which geometrical optics is based. Next it is shown that there is a com- 
mon symbolic expression for Fermat’s principle and for the principle of least 
action which is equivalent to the laws of classical mechanics so far as conser- 
vative systems are concerned. The most ingenious step is the third which 
establishes a connection between classical mechanics and Schrodinger’s equa- 
tion, of Jhe same logical form, which connects geometrical optics with the wave 
equation of light. 

• 

• Consider 1 the propagation of an optical phenomenon which is represented 
by the function u(x , y, z, t). If V ( x , y, z) is the velocity of propagation, 
c/V is the refractive index and the \^ave equation, 

(1) 

is satisfied by u. It is possible to express u in the form 

«(*. y, z,t) = A (x, y, z) cos 2 a- v [/— (*. y, z) ] (2) 

v^iere v is a constant. At any instant the surfaces of constant phase are 

(*» y, 2) = const. (3) 

The curves which are orthogonal \o this family of surfaces are called rgys» 
The phase velocity is the velocity with which one has to travel along a ray 
in order to Veep to a given value of the phase, f — <J>. Hence if V' is the phase 
velocity, * 

, v'w - ^ (4) 

where the components of are related to 5/ by die condition'd the constancy 
of phase, viz. 4 
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since 


8 / 


Hr 8 * + * 8y + ■& 82: 


(5) 


/t(fy + (t) ,+ (2)T« 

and 8y and 8z are similarly related to Sr, 

V ' = { (ST + (fy)’ + ( '& )* } ^ | ^ * | ^ ® 

In general, V and V' are not equal. But the two are equal for the wave forms 
of geometrical optics given by 

« » Ceos 2 «■ v [ / - J ~ J , ( 8 ) 

C being a constant. Wave optics reduces to geometrical optics when the con- 
dition 


V - V' (9) 

is fulfilled. From the form of u in (8) and from the definition of a ray it 
follows that rays satisfy 

8 / % ■ °* d°) 

which is Fermat’s principle. Let us now express the condition (9) in quite 
a different form for a future application. The solution (2) is the real part of 

Ae iriVt !*** ‘ t») 

where % 

4> m ~ ♦ v. (12) 

The expression (11) satisfies the wave equation provided 

-4 ir 2 | grad | 2 -4- 2 tt * V* <£ + 4tt 8 v*/V* » 0. (13) 

Hence if v 2 ^ is negligible compared to |grad<£| 2 we have from (7), (12) and 
(13) that V = V'. Therefore, (9) is equivalent to the condition that • 

V*<#> / | grad 4> I * is negligible. (14) 

If v is the frequency, A = V/v is the wave-length. On differentiating 0 in an 
arbitrary direction /, 

g? - - ’ “ v 1 I cos - - y cos«; 

• 

W cos 9 dV b cos 0 dV 
V dl XV dl ' 
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Hence the condition (14) is equivalent to the statement that 

(war )/( i)' is negligible> 

or that X ^ is negligible, (15) 

for any arbitrary direction. This means that the change in the velocity over 
a distance of the order of the wave length is negligible if the laws of geometri- 
cal optics are to be valid. Here is the reason why the interference and dif- 
fraction phenomena cannot be explained by means of geometrical optics. 

Assuming that Einstein’s equations for light, 

E * hv and p = hz = h/X = hv/V (16) 

are valid also for material particles an infinitesimal change in the optical 
phase becomes 

2»v[ dt - y ] = [ Edt - pdr]. (17) 

It follows that Fermat’s principle is equivalent to the principle of least action, 

pdr = 0. (18) 

It was argued, therefore, by de Broglie that just as geometrical optics fails 
when the condition (15) breaks down so also classical mechanics becomes 
invalid in a domain of atomic dimensions where the phenomena must be 
controlled by some equation analogous to (1). By (16) 

« ' \ = hv/p , (19) 

and 90 making this substitution in (1) the wave equation of mechanics was 
first obtained as 1 


, AV &‘u 

V U “ / 0 ? = °‘ ( 20 ) 
In relativistic mechanics, the equation of energy in its usual notation for a 
particle is 

E m a-^r + v(x ’ y ‘ 2) (21) 

and the momentum p is given by 

K »' Fa B ~7 [u-rr -■»■■■]* <a> 

Similarly, in, Newtonian mechanics 


'E - im 0 t? + V(x,y,z\, 
p = mo v ■ j2mo (E — V). 


(210 

(220 
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Hence the wave equation is 

^ U + W [ (£ “ V) ‘ - m '* c ' ] » = °* (23) 

or . V*« + w 0 (E - V) u = 0, (23') 

according as the mechanics is relativistic or Newtonian. 

Let Us suppose that a solution of either of these wave equations is of 
the form (11) with 

4> (*, y, a) = -~ S (*, y, z). (24) 

If the condition (14) holds good S satisfies 

I grad S | * = - E ~ C P~ ~ m\ c\ (25) 

or | grad S | 2 = 2w 0 ( E—V). (25') 

S therefore plays the role of action in the Hamilton-Jacobi equation^ Thus 


when the condition (14) is satisfied the wave equation reduces to if crucial , 
equation of classical mechanics. In view of the fact that y 2 <£ is of the order 
of 1 Jh while |grad <f> \ 2 is of the order of \/h 2 the condition (14) is equivalent 
to the statement that 


h is negligible ; (26) 

classical mechanics must be replaced by the more accurate wave theory when 
h is not negligible. 

It was Schrodinger who first pointed out that, since A. - h/p for atomijc 
particles, A is of atomic dimensions if the experimental values of h and p are 
substituted. Hence, by (15), in regiops of atomic dimensions the wave 
theory would be valid but not classical mechanics. 

We can partially understand now how the new theory provides the re- 
presentation of a particle in terms of waves. 'Fhe optical analogy suggests 
the identification of the group velocity with J the particle velocity. For a 
particle with velocity v, moving in free space, 

r m 0 c a mo v 

L ~ (l-vyc*) 1 '" p ~ (l-i/Vc 8 ) 1 '* 

and so 

v dp m d f E. 


Hence, as p = hx and E = hv 

* dE dv * 

v dp dx 

Thus v turns out to be the group velocity. 


(27) 

> 0 

We shall consider presently the 


. physical significance of this purely formal result 


7 
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We have al90 

v/x « Elp = cVt; 

which is the wave velocity. As v <c, the wave velocity* definitely exceeds c arid 
the waves of de Broglie cannot therefore be considered as physically real. 

In the case of light, in ether, 

E « pc 

and so the wave velocity is 

% v/x = E/p = c 

while the group velocity is also 

dvjdx = = c. 

The two velocities are identical for radiation. 

2. THE SCHRODINGER EQUATION 
The non-relativistic classical Hamiltonian of a particle is given in the 


usual notation as 

0* + ft* + ft 3 ] + V = E. (28) 

If we substitute for p x , p y and p z the operators 

ft t = - dldx, py = - d/dy, ft = - ~ d/6z, (29) 

the energy integral leads to the Schrodinger equation (23') which may now 
tie written as 

v*4> + ~^-\E - V) <1> = 0, (30) 

where is the Schrodinger function. This suggested the postulate that for 
any conservative classical system for which 

H(q,p) = E, (31) 

the corresponding Schrodinger equation is 

H(q, - ~eidq)^ = (32) 

Zir 


The correspondence is generally*, responsible for a certain ambiguity if 
Cartesiac coordinates are not used. The point may be made clear by consi- 
dering a, particular term p r 2 /r* which appears in the Hamiltonian when polar 
coordinates are used. The question now is whether this term is to be re- 
placed by 


1 d* V 1 
W ° r 0r> / 


( 33 ) 
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The Hamiltonian of the wave theory does not therefore follow unambiguously 
from the classical theory if any arbitrary system of coordinates is used. We 
have pointed out earlier that a similar situation had arisen in the treatment 
of the Stark effect on Bohr’s theory as the Wilson-Sommerfeld conditions 
could not be expressed in terms of invariants in a perfectly general coordi- 
nate system. The present question was tackled by Podlosky in 1928 with 
a view to getting the wave Hamiltonian unambiguously when the coordinates 
are not Cartesian. 

For non-conservative systems the total energy, when it exists, is a func- 
tion of time ; so v* 

E = ~ 6/8t (34) 

gives the operator to be used for E in such systems. (29) and (34) may 
be looked upon as postulates of Schrodinger’s theory. But it is possible to 
deduce them from the postulates of Dirac’s theory of states and observables. 
We have already shown the connection between Poisson brackets and non- 
commutative relations of the type, 

qp - pq = £-• (35) 

Z fl- 
it is sufficient to remark here that (35) is satisfied by 

P = e/dq. (36) 

For if f(q) is any arbitrary differentiable function of q 

(qp - pq)f(q) = ~ [/(«) + «/' (?)] “ £ ?/' (q) = 07 ) 

The unusual substitutions (29) and (2£) are thus connected with the Pois- 
son brackets of the classical theory. In Dirac’s theory it is not the substitu- 
tions that are fundamental but relations such as (35). Given a function 
<f>(p) instead of f{q) we may take 

q = f- eiep, (38) 

so that (qp - pq) <t> (p) - 4> (p). (39) 

The substitutions of the form (38) are excluded in Schrodinger’9 theory but 
they are permitted by the more general ry>n-commutative relations of Dirac's 
theory. The relation (38) is used to get a wave equation simile to Jchrodin- 
ger’s for the momentum probabilities of a^ particle. In addition 13? fhe signi- 
ficance of differential operators what may be emphasized here # is the greater 
generality of Dirac’s theory. , • 

It is interesting to see «how the Wilson-Sommerfeld conditions are in- 
corporated in the theory of Schrodinger’s equation 3 . The substitution of the 
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operation as given hy (34) extends the scope of the theory beyond the multi- 
ply periodic systemsi of Bohr’s theory. Already we have seen that 

* log <J> « + 2t irt v/, (40) 

which is true for any conservative system of particles. Hence one of the 
adiabatic invariants for a conditionally periodic system is 

$ pdq ~ $ ^dq = ~ (log 4») dq\ 

log * s a multiple-valued function so that we get 

$ pdq * nh , (41) 

where n is zero or an integer. The value of rt will naturally depend upon 
the form of as a function of q. 

In the early stages of wave mechanics de Broglie gave an interesting 
explanation of the quantum numbers. If in a string with two ends fixed 
stationary vibrations are set up there is always an integral number of nodes 
on the join of the fixed ends. If n+1 is the number of nodes, d the distance 
between consecutive nodes and D the length of the join, nd = D. On this 
analogy it was argued that an electron in a Bohr orbit represents a stationary 
vibration of n nodes so that 





But whdn we use the condition, h/X = p = tnv, 
«. J tnv ds * nh, 


(42) 


which is equivalent to a quantum condition in Bohr’s theory. In fact this 
was the first empirical success of de ‘Broglie’s bold assumption that the two 
quality relations, which Einstein had deduced for light, are also true for 
material corpuscles. The later experiments on the diffraction of electron 
beams made by G. P. Thoms# n, Rupp and others showed that the quality 
relations may be assumed to be valid even for atomic particles participating 
in aperiodic phenomena. 

Some remarks may be made regarding the mathematical nature of <J>. In 
general will be complex as otherwise the interpretation of $ functions in 
terms of observables and probabilities completely fails. Along with the 
partial derivations that appear in v Schrodinger’s equations must be single- 
valued and ftrfite so that the integral of |<J>f over the configuration space is 
finite and constant in time. How, the last condition affects the complexion 
of the problem ie most easily seen in the illustration of the harmonic oscillator. 
We have ‘•for an oscillator, * * j 



( 43 ) 
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giving the Schr&dinger equation, 

+ '-jpr(E ~ £ H**) ^ * 0, 
which may be expressed as 

g + (X-aVH = 0, (44) 

where 8* fmE/h 2 = A and 4rr 2 wp.//z a = a 2 . The only solution which satisfies 
the integral condition, n being a non-negative integer, requires 

h « (2w + 1) a, (45) 

which means E m (n + i)hv (46) 

where |i = 4 ttVm. 

The form of isi given by 

-eva 

tin = N n e Hn (0 (47) ( 

where iV n is a constant, //„ the Hermite polynomial of the nth degree 
and £ - sj dx. The value of the constant is determined usually by the 
normalization condition, viz., 


00 

/ I 4>« 1 2 dx = 1. (48) 

-00 

When (48J is satisfied is said to be normalized. The quantum number 
arises in this case through the integral condition which rejects solutions except 
those of the! form (47). By Bohr’s theory we have in place of (46) 

nh - $ pd x = $ pxdt = $ 2 7dt = $ (T + V) dt = E. 1/v. 

The energy \ hv is called the residual or zero-point energy possessed by atoms 
near the absolute zero of temperature. It is customary to reduce a mathe- 
matical problem of atoms and molecules at th? zero point to one of harmonic 
oscillators and then apply the above result of residual energy. This is how 
an attempt is made to explain some of the Van der Waals forces. For elec- 
tronic distributions the zero-point energy is explained by means of the Fermj- 
Dirac statistics. Serious difficulties are still encountered regarding th(f behavi- 
our of matter near the absolute zero ; among the most notorious are the 
suprafluidity of helium and the supracoqpluctivity of metals. It is doubtful 
if the residual energy can account for them. 

When the reader is introduced to t^e uncertainty principle it will be 
understood that, even in the kate of lowest energy, an oscillating particle can- 
not be ajj once at rest ind at the origin as that would f violate the principle. 
Hence, even in <he lowest state, the uncertainty of position and velocity is 
responsible for some energy which is the zero-point energy. In fhe theory 
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of radiatioft ike analogy is provided by the transverse self-energy of the 
electron in free space. 


3. THE TWO INTERPRETATIONS OF 4> 

Schrodinger’s first theory 2 of is known as the hydrodynamical theory. 
Consider an electron of mass m and charge e moving in an external field 
of potential V so that satisfies the equation, 


+ eV* - 

o ir tn 


h 

2 ir i 6t 


0. 


(49) 


It is usual to denote the conjugate complex function obtained by changing 
the sign of # in <J> as <}> x . Evidently <1> X satisfies the equation, 

h* _ . h er 

dt 




= o, 


(50) 


which is implicit in (49) and which is obtained therefrom by changing the 

t ^gn of t throughout. The charge density p, the current density I and the 
energy density u are defined by 

P = - e (51a) 

___ p ij 

I = ~~ (V grad <1>- 4> grad 4> x ), (51b) 

4 v tm 

lJ 

u = r "j- grad 4> x - grad 4>. (51c) 

Off Wl 


With these definitions the laws of conservation of charge, momentum and 
energy follow in the form, 



jifp dx ' = °> 

' (52a) 

ij 

f I d t = ~*f 4> x 4* dz. 

(52b) 

dtj 

fudz ■ £ e V (4> x 4*) d z. 

(52c) 


where Sx is the volume element and the integrations are carried over all 
space. To deduce (52a) and (52c) multiply (49) by and (50) by ; 
adding and subtracting the two gives the results when Green's theorem is 
applied. *For^52b) multiply (49) by dty x /dx, differentiate (50) with respect 
to x and* tHen multiply it by t}> ; subtract one from the other and integrate 
the result using Green’s theorem. The normalisation condition of $ makes 
it necessary that <|> should vanish at infinity. This* fact has to bg used ir 
each of the above deductions. 9 * 

The above interpretation of was responsible for a wrong formula foi 
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the intensity of emitted radiation. In the case of an atom as a conserva- 
tive system the wave equation is 

H<|> = E4> (53) 

and, corresponding to the different permissible functions which satisfy th$ 
equation, there are found to be discrete values of energy E n . Thus E n h . 
the eigenvalue or characteristic value of E corresponding to the characteristic 
state <l>*. If the reader is puzzled as to how these discrete values of energy 
and the discrete functions arise the equation (53) may be likened to that 
of an ellipse. Corresponding to each direction there is a definite value of 
the central radius vector just as there is a value of E corresponding to 
each But only two mutually perpendicular directions, viz., those of the 
principal axes and the corresponding values of the vectors have a special 
significance inasmuch as the whole ellipse can be constructed with a know* 
ledge of them. Similarly, of physical significance are the eigenfunctions cj> n 
and the values E n which give the full content of (53). Incidentally, it may 


be remarked that the eigenvalues may form a continuous range. As in the 
case of the harmonic oscillator both and E n follow from the integral condi- 

tion on <J>. 

Now a large assembly of atoms 5 is described by 



<!> = S<v4>r 

(54) 

where | c r | 

is the number of atoms in the state <Jv. We may express 

<lv in 

the form 

— 2 it i E r t/h 



<t>r = <l» 0 r ( *, y, z ) e 

(55) 

For on substituting for in , 



h a, = A dfe 

2 * Jt 

(56) 

we get 

H^r~ Erttr. 

(57) 


o o 2 ir ( (*E S —E r ) tlh 



4> X <1> « 2 2 c, c s <]>,<!»*« , 

(58) 


where c$ is the conjugate complex of c 8 . An absorption or emission between 
the levels E r and E, thus depends upon the number of atoms in Loth tlie 
states according to the first theory of ty. This result contradicts experience. 
Even according to Bohr’s theory the intensity of emitted radiation depends 
upon the number of atoms in the higher fevel only. The suggestion that the 
intensity of radiation can be determined from a formula such ar ft>8) has 
failed. 

Another failure 3 of this interpretation of i|> , was occasioned by tlie sup- 
position that electrons are charge clouds. ' If we assume that the different 
parts of a charge cloud repel each other we get wrong results definitely. More- 



WAVE MECHANICS 


SfS 


over, charge clouds will not behave as corpuscles in collision experiments. In 
a problem of two particles the charge cloud must be supposed to be spread 
over a space of six dimensions. These difficulties were enough to discredit 
the first interpretation of 4> although it gave a satisfactory explanation of 
the stationary states through those terms of (58) which do not change with 
time. It was Bom who supplied the second interpretation and prepared the 
ground for Heisenberg’s uncertainty principle. 

According to Bom, if is the Schrodinger function for a system of n 
coordinates q x q 2 q n > 

<1> X 4> dq\dq% dq n (59) 

is the probability that, at time t, the system is in a configuration bound by 

q r and q r + dq r for r = 1, 2, n. The first successful application of this 

interpretation was made for deducing the scattering formula for a particles 
obtained by Rutherford from classical principles. A simple example may 
be taken now introducing the new meaning of 

For a particle moving along Ox in free space the wave equation runs as 
follows : 


ti* a*<i> _ h cty 

8/m dx % 2ni dt 

(60) 

t 2 irf (tx - v/) 

A solution is = A e 

(61) 

where A is a constant and 


— 1 — t* + hv = 0, 

. m 

(62) 

which is obvious when expressed in the usual notation as . 


O 

ii 

1 

S 

(62) 


A particle of momentum p and energy E is expressed by the wave function 
(61). There is no such thing as a definite position associated with it. A 
particle whose dynamical qualities p and E are precisely defined is indefinite 
as regards its kinematical qualities inasmuch as they are represented by the 
infinite wave. An echo of Zeno’s paradox of the arrow comes to the mind 
?s one notices that velocity or momentum at an instant cannot be correlated 
with pbsition. 


Consider 1 now a wave group defined by the equation, 


+ (*.<) = fa 
— 00 


dt 

which' satisfies (60), z and v being connected b^ (62). Let 

, . “(* “ -to)V 

a (x) » e 


(63) 


( 64 ) 
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so that %q is the value corresponding to the maximum of a(x). 2 q 
defines the length of the symmetrical zone about the line of symmetry so that 
the ordinate falls here from the maximum to e~ x of it. q is taken as the mea- 
sure of precision. The smaller the value of q the greater is the precision. Put 



x = Xo + 9$ 


(65) 

and 

v - 

Tn tn 




= v 0 + i (2to 9$ + 9 5 ?). 

m 


(66) 

Hence 




<i> u, t) - 

qe 2 + - 
— 00 m 

j 

2m J 

* 

= 

jv ^ 

M a 


(67) 

where 

a - P - »?(* - ■ jr q {x - 

■ v„t). 

(68) 


Here v 0 is the velocity corresponding to the maximum coordinate of the wave 
packet. It is the average velocity associated with the particle. The probabi- 
lity distribution is given by 






~2rrW(x~ Vot)* 
e 1 +"Gr hqHftnY' 


(69) 


The precision parameter now becomes 

tU i+ (1r)‘}* (70) 

As the parameter increases in value the wave group becomes more and more 
diffused. A particle moving with an average, velocity v 0 in free space can 
be represented in wave mechanics by a Gaussion curve which spreads with 
time so that the maximum moves uniformly with velocity v 0 . It is obvious 
from (65) and' (66) why we take v 0 /t 0 * v 0 for the average velocity of 
the particle, which is by definition the average vahie of dv/dx. 

Three conclusions are suggested by the above calculations. Firstly a 
solitary wave like (61) describes a possibj^ state in which a particle can exist 
but tells nothing of the configuration. For the configuration w! havo to take 
a wave group like (63) and see how it is rontrolled by the wave equation. It 
gives us a more diffuse configuration with the passage of time so that the 
uncertainly associated with the average value increase^ Incidentally, it is 
rather unfortunate that q should be called the precision parameter since an 


8 
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increase of it means an increase of uncertainty. But what is equally import- 
ant is that if we know the configuration at f = 0 the uncertainty increases 
not only for t > 0 but for t < 0 as well : this is the second point we should 
emphasize drawing attention to the form (70). Thirdly, configurations neces- 
sarily imply wave 1 groups and hence with the same particle, in a particular 
configuration, different values of energy and momentum are associated through 
the concept of probability. How this is done may be explained with the 
help of some fundamental results. 

Let ^ be the different possible normalized eigen-^’s of Schrddinger's 
equation, 

is 

Then 

/ 4V* 4v dz = 1 (71a) 

and 

/ 4v x - 0 (71b) 

the element dx standing for dq 1 . . ,dq n when q lt q 2 ...q n are the co-ordinates ; 
the integrations are carried over the whole 9-space. The first result 
is merely a restatement of the normalization condition. The second 
implies the orthogonality of wave functions which is true if <J> r and 4>» corres- 
pond tq different values of energy. For 

h£ = Er£, 

Htyt = E, < 1 >«. 

Hence *’ 

4*/ Hty, - 4>s # 4v x = (E, - Er) <!>/ 4>.* (72) 

The 'integration gives, on using the operational form of H and Green’s theorem, 
m|T' * . 

(E,-E r )f$r X hdx = 0. (73) 

A morg elegant proof is given on the basis of Dirac's theory. The ortho- 
gonality condition does not necessarily hold good if E r = E t , that is, if 
are eigenfunctions of a degenerate state. 

In general it is possible to v express the eigen-4> of a system by 

<i> 7 S Cr^r (74) 

or by an integral. c lf c a eta are in generally complex numbers. If all 
the 4> f 8 are normalized • 


SUM. 


(75) 
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We now say that if the system is in the state ^ the probability that the energy 
of the system is E r corresponding to ty, is ]c r | 2 . Similarly, in the particle 
problem discussed above the resolution of (x, t) into wave functions of 
the form (61) gives the probabilities of different momentum values. 

Are the probability waves 8 real ? No ! They are just a logical fiction and 
their purpose is to explain the observed atomic phenomena. In a general 
problem the waves are multi-dimensional ; the wave packets usually disinte- 
grate rapidly ; an actual discovery of a particle makes a wave-packet sud- 
denly shrink in size. These are characteristic properties of wave packets 
which show that they belong essentially to the logical structure underlying the 
phenomena. 

In classical mechanics a problem is solved when the coordinates are ob- 
tained as functions of time. In the new theory of wave mechanics the solu- 
tion must answer adequately all possible questions of the form : if a dyna- 
mical observable has 3 value lying between u and u + du what is the proba- 
bility that another observable has a value lying between v and v + dv? The 
Schrddinger theory, as modified by Born’s interpretation, did not provide 
answers to all such questions. The theory of <J> answered only the question : 
if the energy has a particular value what is the probability that the position 
configuration stands in a certain region of the coordinate space ? The momen- 
tum configuration could not be deduced from the function. All these ques- 
tions were later successfully attacked by Dirac’s transformation theory. 

The concept of probability has been most successfully applied *to pro- 
blems of collision and aperiodic phenomena in general. §ut at the stage 
where the atomic theory was brought in 1927 the phenomena of radiation 
were not satisfactorily explained. The relation between photons and waves 
and that between electromagnetic waves' and probability were misunderstood 
or remained a mystery. A proper theory of the co~ efficient of spontaneous 
emission and the electron spin was not yet deduced from the postulates and 
physical principles of the theory. ^ 

Schrodinger's theory in terms of probabilities was a definite advance ofeT 
Bohr's theory. It gave a logical explanation of the three quantum numbers 
n, l, m. A successful derivation of the scattering formulae was also provided 
by the postulates of the theory. Although it did not explain the dual nature 
of light a new line of thought was initiated, independent of mechanical models 
and concerned with observed effects only. # The new phenomenological outlook 
must be distinguished from the Aristotalian philosophy : the former proceeds 
from the phenomena to the logical correlation, the latter from the postulates 
to the phenomena. The change in attitude has been responsible for a new 
concept of t particles whicfi is quite different from •the classical. It is* not sur- 
prising therefore that the cla&ical concept of causality finds no place in the 
scheme of new particles. 
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4. SOME COLLISION PHENOMENA 

We will first describe certain phenomena 4 and then try to explain them 
by the method of the Fourier expansion and the probability postulate. 

EE 



Consider an atomic particle moving in free space in a direction parallel 
to Ox. Suppose that a potential barrier is set up on the way as in the figure. 
The behaviour of the particle as a result of the barrier may first be described 
on the classical theory. Since 

* T + V = E 

all motion for which E<V is impossible. For the kinetic energy cannot be 
negative. Thjjre are four cases to'tje discussed : 

(i) t A particle moving with tojal energy E t is undisturbed by the barrier. 

(ij) If the total energy is E, a particle ran move towards the barrier 
* from either side. *The subsequent motion will be influenoed by the 
nature of impact at A or B. In any case a partide will not be able 
to jump over the peak. 
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(iii) When the total energy is E 3 motion is possible in DE, to the right 
of F, or to the left of C. But no particle moving in any one of 
these zones can migrate to another. 

(iv) Only in the zone GH is motion possible with total energy E A . 
These four situations are often envisaged in atomic physics. We 
will now see in what way a particle of wave mechanics behaves in 
similar situations. 

(i') If the E 1 level is not very much above the potential peak and if the 
potential disturbance extends over a few de Broglie wave lengths 
the particle may be reflected back. When cathode rays are passed 
through a parallel-plate condenser it is discovered that some of the 
electrons are reflected back when the plates are brought sufficient- 
ly near together. 


(ii') If the E 2 level is not very much below the peak the so called “ tun- 
nel effect ” is witnessed 1 : the particle moving on the left has a 
finite probability of appearing on the right and vice versa. This 
effect is particularly marked in nuclear phenomena. Gamow’s 
original theory of a decay was based on this effect. The equation 
of transmutation, 


3 Li + J H = 2 He + g He, 

» 

implies that a proton can penetrate through the barrier of repelling 
charges in the lithium nucleus and split it into two a particles. The 
experiments in which surface electrons are pulled out of metals by 
applying a uniform electric fteld outside provide a similar situation. 


(iii') A particle coming from the left of C with energy E 3 has a finite 
chance of going into a state of periodic motion in the region DE 
provided CD is equal to an integral number of de Broglie wave 
lengths. Similarly, if EF is also equal to an integral number of 
de Broglie wave lengths, the particle may appear on the right of F. 
This requirement is partially fulfilled in the theory of Ramsauer's 
scattering experiments on inert gases and in Oppenheimer , s # experi- 
ments on hydrogen atoms in an electric field. 


(iv') According to Dirac’s theory* the positron make$ its appear- 
ance whenever an electron in one of the negative energy levels 
ascends to a level pf positive energy on account of interaction with 
radiation or •therwise. In the transmutation, 

f 3d p _ 30 
15 r " 14 


Si + e + , 
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an electron appears in the P nucleus from one of the negative-energy 
levels. This is similar to an electron of energy E 4 entering from 
outside into the region GH. 

The explanation of all these phenomena lies in the possible Fourier ex- 
pansions and the probability concept. By the well-known integral theorem 
of Fourier, a function f(x) with a finite number of discontinuities and with 
its behaviour at infinity suitably controlled can be expressed as 

f(x) - / A z e 2vi ' x d x 


where 


w f(x)e~ 2 " irx dx. 


Consider as a particular case 

/(*) = e 2vt ^ x for | x | < JA*. 


so that 


= 0 


for | x | > i A x 9 


iA* 

= j e 2vix ( To— x ) 
-JA* 


dx =* sin v ( t 0 - r ) A x 
tt(t-t) 


At as a function of x, has infinite maxima. The principal maximum 
corresponds to x = x 0 giving the valuj Ax. When 


( x 0 -x) Ax * ± 1 


Ax becomes zero and its value outside this zone of x may be neglected as 
small. The smaller the value of Ax the greater becomes the significant zone 
about To* The usual diffraction 1 formula, 

d sin » n\, 

is* based* on a similar mathematical result. The smaller the breadth of the 
slit d, the greater is the significant extension of the diffraction pattern as 
measured by sin <f>. Now an atomic particle moving with a definite energy 
and momenturn in free space corresponds to a wave, 

• 

The effect pf a* disturbance is to limit in some way th& unlimited expansion of 
A the wave. The partial expansion which is restricted by thUdisturbance can 
be expanded by the Fourier theorem into waves which correspond to different 
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values of % and v and hence of energy and momentum. The probabilities of 
these new states are given by the coefficients in the expansion according to 
(74) and (75). Every disturbance offered to an atomic particle increases its 
capacity to overcome it by providing new potentialities of state. If the dis- 
turbance is not sufficiently small calculations show that the effects described 
above do not occur. 

It is necessary to prove that the probability theory of waves leads to the 
laws of classical mechanics at the common frontier of microscopic and mecros- 
oopic domains. For the other interpretation of $ it has been, already shown 
how the wave form reduces to the action function satisfying the Hamilton- 
Jacobi equation. Following Ehrenfest we take the simpler case of one parti- 

— SV 

de moving along Ox under a force V being the potential. The centre 

ex * 

of gravity of the probability distribution is 


J 4> x x dx 

X _ £ 

/ 4> x dx 

and if is normalized 


X = f <J> X x tydx. 


Using the wave equation 
h* 

8 ir'm 




- Vty + 


ih dty 

2 ir 8t 


= 0 


we find 


mX = f 

The right-hand side gives the average value of dV/dx for the distribution. 
Hence the Newtonian equation ^ 

mX = - dVIdX 

follows. Thus the most probable distributions follow the classical laws. In 
the problem of a particle in free space we saw that the most probable faction 
dianges with time according to Newton’s first law. 

In the derivation of the strange result^ described in the beginning of this 
section where a de Broglie wave is partially reflected and partially trans- 
mitted the boundary conditions satisfied at the surface of disturban<^ are the 
classical ones, viz., (i) the continuity of and (ii) the continuity of the 
normal derivative of <]>«• These additional boundary conditions ’fomjalfy com- 
plete the^heory ^-aperiodic ^phenomena to the limited extent of Schrodinger’s 
method. 
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5. THE KEPLER PROBLEM 

The most important application of Schrodinger’s theory has been made 
to the hydrogen-like atom with a nuclear charge Ze and only one satellite elec- 
tron. This is known as the 1 Kepler problem on account of its resemblance to 
the problem of planetary motion. The electronic levels are determined in any 
atom, containing n electrons, by reducing the problem to the simpler one of a 
nucleus and one electron. Fortunately such a reduction is possible within the 
limits of approximation ; otherwise the description of electronic motion in an 
atom would be very complicated. The theory of the periodic table and the 
application of the Pauli exclusion principle are based upon the assumption 
that, to the first degree of approximation, every electron orbit can be calculated 
in terms of the Kepler problem. The Schrodinger equation in this case runs 
as follows : 

V 2 4- + (e + z f ) * = o. (76) 

This is obtained on the assumption that the motion of the nucleus is negli- 
gible. We are interested in “functions of the form R(r)S(0)T(<j>) and 
hence we write the differential equation in polar coordinates, 

,i . L £!$ , L_ ? / ■ a s A\ 

r l dr \ dr ) r sin s 0 d<f? r 1 sin 0 dQ \ Sin dO ) 

+ ^ ( E + ^ ) <|> = o. (76>) 


The equation splits up into the following three : 



% 

To know how the splitting arises multiply (76') byr 2 sin 2 Then we 
notice that the equation assumes the form 

/(r,*)M-A(+) - 0 . (80) 

As r, 0 K <j> are independent coordinates (80) is equivalent to 

v 1 fi (<t>) - a, f(rM = a, 

where a is some constant ‘Similarly the equation f(r, 6) = « splfcs up into 
two e and 4he other contant p is introduced. Since must be single-valued 
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and vanishing at infinity it is necessary that « = -m*, m being aero or an 
integer. Solving (77) we get the normalized functions. 

7. (81) 

V2>r 


2 jr 

JV** T* d *> = 1. 


2jr 

For m ^ n, J T m * T n d<t> = 0. 


m is the magnetic quantum number. The azimuthal quantum number / is 
introduced in the definition of S. The restriction on the nature of $ permits 
only 'a polynomial in cos 0 and sin 0 as the solution of S. We get the 
Legendre polynomials. 




2±±l (l -[ ml )! 
2 ’ (/+) m\ )! 




(88 


where 

P = / (/+ 1), (83) 

and / is a positive integer not less than | m |. The numerical factor of (82) 
is introduced so that the normalization condition 




is satisfied. It may be verified that 




Incidentally, we observe that between the azimuthal quantum number l and 
the magnetic quantum number m is the relation, * 

l>\m\. (84) 


If (84) is violated the S-functions do not exist. Similar considerations of (79) 
give the only possible solution * • 


Km M ■ - 


r / 8ir *mZt \ \ (x— /—!)*! -ij —p/2 

LV ) 2»{(» + l )\ }*J , 

o - 8ir* mZir 
nh' 


L 


2 /+\ 

» + l 


(f) (85) 


where 

9 
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The L-functions are the associated Laguerre polynomials. The integer if ifi 
such that 

*>(/ + 1 ) 

and it is called the principal quantum number. The R nr functions are all nor- 
malized. The characteristic ^.functions are now given by 

They arc orthogonal and normalized on account of 

00 7T 2 7T 

/// | ty Hlm | s r 8 sin 9 d 0 d* dr - 1, (87] 


and 


<» ** 2 ir 


/// 





0 , 


( 88 ] 


where and n\ m' art not both the same set of quantum numbers. 

The energy constant assumes the eigenvalues, 


En 


tin* 


m 


These are precisely the values obtained by Bohr using the quantum conditions. 
In wave mechanics the eigenvalues follow from the restrictions on the <[>-func 
tlon of Schrodinger’s equation. 

The electronic levels for which / = 0, 1, 2, 3, 4 are called respectively 
s, p, d, f and g levels. The first four" letters were suggested by the sharp, 
principal, diffuse and fundamental lines of the hydrogen spectra. l\= 4 is 
called a g level because it follows the / level. The electrons for which / = C 
constitute the K shell. The higher values of n give the successive L, M, N, 0 
6tc. shells. Corresponding to a given value of l there are (2 /-hi) values 
of m given by (84). If the principal quantum number is n the differenl 
pairs of l-m values are in number, 

2 1 (21+1) = n\ (90] 

J = 0 

There is a fourth quantum number for an electron : it is the spin $ which hat 
the possible values ^ $. Hence in the nth shell the ftaximum numtyr of elec 
trons that can be padeed is 2 n* if two or more 'electrons not to have the 
same set* of quantum numbers n l m s. We are using here the exclusion 
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principle in giving each electron a separate wave function ty n i m ? Thus in 
the K shell there can be only 2 electrons, in the L shell only 8, in the Af shell 
only 18 and so on. 

The connection between the energy constant and the quantum numbers 
merits attention. If we construct the Schrodinger equation from the classical 
Hamiltonian we get (89) in which case the energy depends only on the prin- 
cipal quantum number. If we take the relativistic Hamiltonian and solve 
the wave equation resulting therefrom the energy is found to be a function 
of both n and /. This is in good agreement with Sommerfeld's classical theory 
of fine structure. In a magnetic field the energy is influenced by the quantum 
numbers m and s also. Moreover the constant Z varies from atom to atom 
so that an electron in the / m n s level in one atom has a different energy from 
the electron in the same level in another atom, there is, dne for which Z is 
different. In Bohr’s theory, the adiabatic principle had to be invoked to 
justify the use of the same quantum numbers for different values of Z. No 
such difficulty arises in wave mechanics because of the manner in which the 
quantum numbers are introduced in the theory. 

A physical interpretation of the quantum numbers l and m is provided 
by the differential operations of moments. The angular momentum of an 
dectron about the z-axis is given by 

H g - xmy — ymx 
= Xp y - yPx 

= — (x d/dy - y d/dx ) ■ — d/d<j>. (91) 

It: 

When Hg operates on „; m we have from the form of T m 

H *Km = - £ ’ ( iik ) Kim = ff Km • (92 > 


The eigenvalues of the angular-momentum operator H g are km/2ir. The 
resultant angular momentum is given by 

H* - ft’ + Hy + H*. * 

= (y p, - 2 Py) 5 + (Z p, ~ X P')' + (x Py - y P x )\ 

- - (A-)’ [(yd/dz-z 8ldy f + (z dldx - * 6/6z)' + 

(xtydy — ydIdx) % -\ 
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«L „ - 

H * *i” lm = ~ 4? ^ sin <9 dW ] ii rim 

“ 4? P ^nlm m Hl+l )** tim l4S. (95 

The expression for the eigenvalues of H is — [ l ( / + 1 )] >'». The connectio 

between the magnetic moment M and the angular momentum H is given o 
the basis of the correspondence principle by 


M - 


e 

2 m 0 c 


H. 


(94 


Similarly if 5 is the angular momentum due to the spin, we have the magneti 
moment M given by 


A 

Af 



m 0 c 


(95 


A rigorous derivation of this correspondence is extremely difficult. Thi 
hydrodynamical interpretation of <J) provides a derivation of (94). The othe 
was derived by L. H. Thomas by considering the relativistic effect of th< 
electronic motion on the precession as observed from the nucleus. Dirac’i 
theory gives a rigorous explanation of both (94) and (95). It should alsc 
be noted* that it is not right to associate a genuine spin with the electror 
in the mechanical .sense. In (94) and (95) m„ is the mass of the electron. 

The associated Laguerre polynomials show that /?„, has (w-/-l) zeros. 
The effect of it is that $ vanishes on (,» - 1) spheres. The function S* has 

(l-m) zeros while T m has m only. Hence the total number of nodal surfaces 
for an electron vibrating in the state <{) Blm is (n-1). As | (J» | s is the proba- 
bility density for the position of the electron at any> time we notice that this 
is independent of time. The vibrations of the electron are stationary 
Hence there is no loss of energy due to radiation. We can interpret Bohr’s 
quantum conditions as the conditions for stationary de Broglie waves. The 
nature of the function | $ |* shows that there are n probability humps sepa- 
rated by the (« - 1) zeros. It is customary to refer to the probability distri- 
bution of the electron as the charge cloud as in Schrodinger’s first theory of <J». 

The average value of any functibn f of the electronic co-ordinates is given 

hv • < 


*• / J I f 4* «im r* rin 1 9 d9 8$ dr, (96) 

0 V t 

for the state The probability that the electron exi §*3 at a distance 

betweqp r and r + dr, irrespective of the direction, is 
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2 IT It 

ffKlrn *nlm *r sin 0 d* d* 

0 0 

2rr 7T 

= R \ | S U| T m \'***d»*. (97) 

0 0 

We plot /?* nI r* against r and where the maximum is reached the analogue 
of the radius of Bohr’s orbit is obtained. 

Given a number of charges in a state of motion about the origin the field 
intensities and the flow of radiation can be determined as functions of the 
dipole and quadrupole moments of the charges by the laws of classical elec- 
trodynamics. For an atomic system with particles having charges e lf e 2f ... e k 
etc. the ^-component of the dipole moment is 

pn ? - 

wltere ^ e~ 2tlE n ,t!h 

Hence the electric moment vector associated with the transition w->u' of an 
electron is 

fin n’ _ finn’ -2 vi (E n - E„’)t/h (98) 


where R nn = - e f< J> , , dr, etc. ; * (99) 

X J nlm ru'm' 

the intensities of the radiation are obtained from the vector R nn ' by using the 
classical formulae. It can be proved that if \ l-V \ =£ 0, R nn> - 0. Hence 
we obtain the selection rule, % 

A/ = ± 1. (100) 

The system of charges may have no dipole moment as, for instance, a 
system consisting of two equal positive charges at two points and a negative 
double charge at the midpoint ; the quadrupoll moment must be considered 
in this case. We have now, / 


jnk 


■-‘Si 


dx . 


# (101)’ 


nlm 'WVm* 

The selection rule (100) no longer applies. Lines which are forbidden by the 
dipole moment are allowed by the formul^’ (101). Like this effect there is 
another, of the second order, due to the magnetic dipole. # 

i 1 • 

If there i9 a magnetic field along ffz let us take 


** 


P cos <t>* 
^ nlm 


= Psii 


= Xe 


im<f> 




dx r P d $ d z dP, 

im'p 


n’Vrn! 


XV 3 


and 
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Hen “ *- f f*x***f ( 102 ) 

The integral vanishes if 

m - m f + 0. 

Similarly calculating the other components, 

/ Kim * K'l'rt' d T > f Kim y *n>l’m> % 

we find that they vanish unless 

| m-m f | = 1. (105) 

The peculiarity of a dipole is that there is no emission of radiation in the 
direction of vibration. Hence when m = m\ as there is only the ^-component 
of the dipole moment, there is no emission parallel to the field direction, Oz. 
The maximum flow of radiation is at right angles to Oz and it is polarized 
parallel to Oz according to a classical result of the oscillator. The radiation 
corresponding to A m = ±1 is circularly polorized about Oz. An excellent 
brief account of this question is given in a general manner by Condon and 
Shortley. They have shown the exact relationship between the electric moment 
and the electric intensity which gives the direction of polarization. 

An interesting result regarding S l | mf t 0 \ and is given by Unsold. 

It is . 


(103) 

(104) 


<« 



[ ^1 m | (<9) ] 5 [ T m U) l m const 


(106) 


This theorem has important applications in the problems of many electrons. 


The effect 6 of a nuclear disintegration on the atomic levels may be men- 
tioned in passing. Suppose for the sake of definiteness that an electron is 
emitted by the nucleus at t — 0 changing the effective nuclear charge from 
t to z + 1. Let the original wave functions be ty\ x , 4>s*, etc. and let the new 
ones be similarly indicated by 4>i* +1 4>2* +1 , etc. The characteristic value of 
the energy also will change. Consider an electron of one of the shells given by 


. z - 2jrt Ei f h 


(107) 


As a iesult of the J3 emission, the possible states for the bound electron are 
4V** 1 for r = 1, 2, 3 etc. Hence, 


g 2m Eit/h 2 Cr ^i+i e 2m E' r tih 

« 

at the time of emission, giving 

- , ' <!>!* = SC, 4v* +1 * 

C r - /<!>V +, <W f rf 


( 108 ) 


Therefoiy, 


( 109 ) 

( 110 ) 
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The probability that the new energy is E' r is \C r \ 2 . This also means that 
the probability that the outgoing electron takes away the energy E- X -E' r from 
the bound electron as it passes through the shell i9 |C f |®. Similar conclu- 
sions follow regarding the energy changes of the other bound electrons. 
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V MATRICES AND WAVES 


1. Heisenberg’s Matrix Theory. 2. The Harmonic Oscillator. 3. The Con- 
nection between Waves and Matrices. 4. Probability Postulates. 

1. HEISENBERG’S MATRIX THEORY 

From waves and corpuscles we have now to turn to matrices. The con- 
spicuous failures of Bohr’s theory had shown the utter futility of all attempts 
to visualize the atomic processes through pictorial representations. We have 
seen how Schrddinger’s theory assumed a new and abstract form although 
it originated from de Broglie’s attempt to reconcile two different representa- 
tions of matter, as of radiation. Heisenberg’s approach to the quantum theory 
was largely influenced by the difficulties that he and Kramers encountered in 
a treatment of dispersion by Bohr’s theory. The main difficulty was that the 
conceptual element of the latter theory had grown out of proportion to do its 
job of correlating the empirical facts : and it inspired Heisenberg to initiate 
a new line of attack. Its essence lay in the phenomenological outlook, 44 from 
the surface to the centre” philosophy. The theory started from physical 
phenomena and ran on to the mathematical postulates. 

For a conditionally periodic system the Fourier expansion of a coordinate 
runs as follows 1 : 

00 00 oo (M 

= s s S Q r 

x l«-oo x 2 = — oo x /=- oo 

w 

The amplitudes $ TlTj . . . y at id the frequencies Sxv are functions of the 
" continuously variable constant? ” of integration. It is by varying the con* 
slants of integration that different dynamical states are obtained. The expan- 
sion (1^ is not directly useful fo r a description of the quantum phenomena. 
Bohr had to supplement (1) by quantum rules. The phenomena are de9crib- 
able by terms of the form, 

\2jrt'v (tir,m,)t 

q k (nr,mrV k ( 2 ) 

where H r and m, are sets of, integers ea^'h consisting \.f f. If the\Bohr fre- 
quency condition is' to hold good, there must ‘be two s&s of integers for 
distinguishing a transition between two states. Secondly, if the Rydberg- 


1 x 2'“ x f . 


2irt('tiVi+T»v,+ ...+yv f )t 
e (1) 
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Ritz combination principle is to hold good, the frequencies have to fulfil 
the law, 

V ( fir, k r ) + V ( k r , m r ) = V ( Hr, m r ). (3) 

Heisenberg's first problem was to establish correspondence between the terms 
(2) and the coefficients of the expansion (1). On the basis of classical elec- 
trodynamics it was assumed that terms of the form (2) describe fully the 
character of radiation and its intensity. Hence for a full-fledged calculus of 
observables such terms appeared sufficient. The correspondence principle for 
large quantum numbers, as it had been enunciated by Bohr, was used by 
Heisenberg to connect the terms (2) with those appearing in the classical 
expansion (1). One thing that looked quite clear at the outset was that the 
different terms connected with different sets of integers are independent of each 
other. The ensemble of discrete terms could therefore be arranged as a matrix, 


Q k ( n rt m r ) 


2t rf’v ( n r , m r ) t 
q k ( n r , m r ) e 


( 4 ) 


the different rows being given by the different values of n r and the columns 
by those of the other set of numbers. The matrices are assumed to be Her- 
mitian so that 


Q ( n rt m r ) = Q* ( m r , n r ) 


( 5 ) 


or as 

v ( n r > m r ) = - v (m r , n r ), (6) 

q k ( « r , m r ) = m rt n r ). (7) 

The last assumption was suggested by the presence of conjugate imaginaries 
in the expansion for q k . Consider now how the correspondence is carried out 
for large quantum numbers between the ftassical terms and the matrix elements. 

Using the conditions 


h = Hfc h, Vfc = 


8H 

dh 


( 8 ) 


of the classical theory we get, when n% isiarge compared to a k , that is 


n k — 


Vfc 


dH m # (jHi, i ~ H ( nil, m t . . . m f ) 
dh / ak h 


from which follows, for n r > > (n r — ^ f ), 


v ( 


n h h s , m h [ H ( n r ) - m r ) ] 


* 2 ( «r “ mr ) V r . 


( 9 ) 
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In the Fourier expansions of q k , in the stationary state H (n T ), there arises 
a term associated with the frequency 2 (»r — m r) v r and we identify that 
with q k (n„ »»,). It must be emphasized that the matrix terms for small 
quantum numbers have no analogues in the classical theory. 

For large values of n, m consider the (», m) element of the matrix 
(2vi/h) ( qp — pq) : 

| (2irt lh)[q(n,k) p(k, m) - p{n, k) q{k, w)] e 2lrtV ^ M,w ^ 

* a g n - a) p{n - a, tt - a - $)—p(n, n - P) q(n - P, 

* - a - P)] e 2 * Mn ’ m)t t 


where a 4* P * n + m. Also 

V(«, W — a - P) = S (a r + Pr)Vr» 

r = 1 


q (w, n — a) - q (ft — p, n - (3 - a) = 
and e 2irit S ^ Vr P {n - a, « - a - P) = 


2 Pr ^ W# 

e 2,r,/ S p(p,/). 


Hence 


ewi 


[p(P. /) 


2»-*7SPrV, 


J “ P< • 2-! p{$, I) e 


2nitl$rVr 


. 2ir» 


Using these results we get the («, m) element of the matrix ~r (.qp - pq) 


s ssj [«(», n - a) - q (n - P, n - P - a )]p(» - a, « - a- P) 

« P 

— £ p(n, m - P) — p(« - a, n — pv- a) (m - P, » — P — a) J 

27rt 2ir»'v(«, « - a — P)/ 

T c 

' y /‘_3« 8P \ 

' va/r aw; a/ r ew r ) 

-~lq,p]. ( 10 ) 

* n\ 

The (n, m) element of the matrix uims out to be the (n-m) harmonic of 
— Iq» t> 1 expressed in terms of the action-angle variables. In view of this 
correspondence a postulate of HeisenbeVs theory is^tated as 

th r li 

“ 2^ [fl, Pi 


(qp-pq) 


(li) 
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where q and p are canonical conjugates of classical mechanics. From (11) 
and from the classical relations satisfied by the Poisson brackets we find 
Heisenberg’s matrices subject to the following quantum conditions : 


qi Pi ~ Pi qi — ^ 1 = I”? (l 2a ) 

QiPk = Pk<h, d2b) 

the Pi -Pi Pk= 0, (12c) 

Qh Qi — Qi Qk d 2d ) 

where k and / are distinct suffixes and by 1 in (12a) is meant a unit matrix 
of the same dimensions as the matrix on the left-hand side. 

The Hamiltonian equations of the classical theory give 

q — [q, H], p — [p, H] (13) 

.and hence it is postulated that, 

f q=(qH- Hq),~P=(pH- Hp). (14) 

Clt *7T 


The equations (12) hold good at one instant. But it can be shown from 
(14) that they are valid for all values of t. Consider, for example, (12a) : 

(qP - Pq) = Qp - PQ + Qp - PQ 

= ~[(qH- Hq) p - p(qH- Hq) 

+ q(pH- Hp) - (pH - Hp)q~\ 

* J 

= ^ [H(pq~ qp) + (qp~ Pq) H] (15) 

= H - H = 0, (16) 

when use is made of (12a) in (15). Hen-je the quantum conditions (12) 

remain invariant for all time. So the secondvpostulate contained in (14) does 
not conflict with the first as given by ( 12) . / 

The p-q relations are further supplerrjented by Dirac by the relation* 

- tE + El -A— • (17) 

J 4 ff 

m > . 

There are two reasons for treating t Jin the same footing as q. The first is 
the principle of relativity Tihich dispenses with the concept of absolute time. 
Hence tfce relative it affectsjthe measurements must be bound* up with 

a canonical conjugate as q and p are. Secondly, in the description of a particle 
the space time and the energy-momentum are so correlated that the*energy and 
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time must mutually obey the same relation that the momentum! and position 
do. This is made particularly obvious by the wave function, 

. 2 (xx — vt) 

4 > ■ * 

taken with the quality relations, 

E a ftv, p - hi. 

We have already shown that the uncertainty relation, 

A/> A* ~ h t 

follows from the resolution of a function of x into wave terms. Similarly 
the Fourier expansion of a time function gives 

A E A t ~ ft. 

It stands to reason, therefore, that a quantum condition such as (17) should 
be valid. Following Dirac’s abstract formulation it will be shown in the 
next chapter that the non-commutability relations lead to the uncertainty 
conditions. At this stage it is enough to point out that the Hamiltonian equa- 
tions follow from the Pfaffian, 


Efr dq r - Hdt. (18) 

If H - E, i and E can be taken as conjugate variables. H — E plays the role 
of the Hamiltonian and the equations of motion are 


x = [x, H—E] 


(19) 

where by [x, y ] is meant the generalized Poisson bracket, 


S d ( x, y) dx by dx 

■ r 6(q»pr ) 8t dE + 8E 

e K 

8t 

(20) 

This gives the additional quantum relations 




tq — q t = 0, tp — pt = 0, 

Eq — qE — 0, Ep — pE = 0, (21) 

-*£ + £/ = ih/2n. 

The equations (13), (14), (19), and (21) are all mutually compatible. 

The diagonal matrices play\a very important part in this as well as the 
more developed theory of Dirac’s* In Heisenberg’s theory the frequency term 
does not appear in any of the diagonal terms as is quite dear from (4). 
Hence for a diagonal matrix, Af, ti e time-derivative must vanish. We know 
that in a Fourier series the constant' Vin gives the time-average of the quantity 
represented by it. Similarly the dia/Xnal terms of a general matrix, give the 
time-averages of the various statiorlar^ states. If a matrix M is such that 

• t • 2 7rtv («> m) M Ofc, m) = 0 ^ v (22) 

for every pair of sets,* » and m and if the'system f isi not d^Kierate 

« 4 v (n, m) ^ 0, and; M (w, m) = 0 ; 


( 23 ) 
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and the matrix is diagonal. In case the system is degenerate so that E n =E m , 
it may happen that 

v («, m) = 0 and M («, m) ^ 0 (24) 

One of the major difficulties which checked the progress of the matrix 
theory concerned the representation of derivatives. The fundamental relations 
between p , q, E, t give the time-derivatives of only the canonical variables. 
The theory must also give us matrix representations of perfectly general func- 
tions of the variables and their derivatives. The difficulty was removed by 
the non-commutability relations. Consider, 

qp “* pq * ih/2ir. 

Hence qtf - pqp = and pqp - p v q - - 

c, k 6 ir 

Therefore, qp 7 - p'q - 2p. 

2 7 r 

Similarly, it can be proved by successive multiplications and additions that 
qp” ~ P n q - f- np”-\ (25) 

Thus if i(p) is any function which! can be expanded in a power series of p , 

qf(p) -HP) q = £-/'(/»)• ( 26 ) 

In the same manner it can be shown that 

*(«)-/(«)* - - -£•/>(*). (27) 

The results (26) and (27) can be proved to be true for more general functions 
of p and q. 

Dirac has generalized Heisenberg’s postulates by stating for any two 
quantum magnitudes the relation, 

) 

| ^ [a,b] = (AB - BA'i (28) 

In classical mechanics if £ is any function oi p’s and q' s, 

k - Infl- 
uence, 2^ ^ ^6) 

and it follows immediately tl^t if // i; a function like £, it must be constant 
and therefore represen^bl^by a diagonal matrix when there is rib degeneracy. 
It can be shown attentively' that if « is a diagonal matrix the relation (30) 
holds good. 
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Returning to Heisenberg’s method the solution of a problem requires that 
Hermitian matrices q kt be determined satisfying the exchange relations 
(12) and making H a diagonal matrix. If the factors 

e 2irit(H„ - H m )lh 

are associated with the matrices p k , q k the equations (14) are also satisfied. 

If fa and q k do not make H diagonal one subjects the matrices to, what 
Dirac calls, a contact transformation. If S is any matrix, the new variables 
q'k ^ g^en by 

p'k = S~ l p k S, q'k = S” 1 qk S (31) 

also satisfy the exchange relations. The property of invariance of a Poisson 
bracket under a contact transformation suggests by analogy the same name 
lere for the transformation (31). The effect on the Hamiltonian is shown by 

H' - S-'HS. (32) 

When S is so chosen that H' is a diagonal matrix the problem is solved. One 
limitation on the choice of S is imposed by the condition that fo, q k as de- 
fined above, must be Hermitian. That requires S to be unitary. Hence 

I, S{n, k) S x ( m, k ) = 0, 

k (33) 

£S(n,ft) S x (*, ft) = 1. 
k 

The equation (32) is the same as 

HS = SH'. 

The matrix terms ( n , m) of the two sides follow the equality : 

£ Hnk Sy.m ~ 2 Snk ffhm ~ Snm Htnm (34) 

k k 

as H' is a diagonal matrix. Corresponding to any particular value of m, by 
varying w, we get a set of homogeneous linear equations which have a non- 
trivial single solution if the discriminant vanishes, that is, if 

| ffli* o rs H' mm | = 0, (35) 

wheTG^8 rs stands for the (r,s)\ term of the unit matrix. All the diagonal 
components of H satisfy that Equation. For each value of m we get one 
column of the matrix S from (&J). In the case of a degenerate system the 
roots H' mm of the determinantaryauation are not all unequal. 

TVie perturbation theory for jmtrices is extremely simple in form. Let 
the Hamiltonian consist of the two \ arts 

‘ 1 € H= tfoW A H # ^ v v (36) 

of which the second* part can be expanded in powers oHt email parameter A : 

A# = XHi + + (37) 
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Suppose that the problem is solved for H 0 which means that the coordinates 
and momenta make H 0 diagonal. For the general problem we have to find 
a unitary transformation function S so that 

HS = SH, (38) 

H' being a diagonal matrix. We will assume the following expansion for 
S and H' : 


S = So + X Si + + (39) 

H = Ho + AW, + A S W 2 + (40) 

Substituting in (38), the different powers of A give 

S„ H 0 =» H 0 So (41) 

Ho Si + H So — Si H 0 = So Wi (42) 


etc. It is enough to discuss here the two equations (41) and (42). The 
first of the two is equivalent to 

2 {s 0 («,*) Ho(k,m) - H a (n,k) S 0 (k,m)\ - 0, 

k 1 j 

or S 0 {n,m) H, ( to, m )—H 0 (n, n) S 0 ( w, to ) = 0. 

Hence if there is no degeneracy, that is, if H„ (to, to) H a {n, n) 

we get S 0 («, to) = 0 » to 

and we can take S„ as the unit matrix. The equation (42) now becomes 


Ho Si - Si Ho + H = Wi ; 

it gives the following relation between the matrix elements : 

Ho ( rt, n ) Si ( «, to ) - S| (n, m) Hu\m, m) + Hi ( «, to ) = W ( n, to ), 
or, using Bohr’s frequency condition, 


hv ( n, to ) Si ( n, to ) = Wi ( n, n ) 3„ m - H r ( n, to ), (43) 

It is assumed that every matrix in the expansion. of H’ is diagonal. From (43) 
we get 

Hi(n,n) = Wi (in). # (44), 

When rt^m the non-diagonal elements of p\ are given by (43). But the 
equation fails to give the diagonal element 7 of S,. Since S is to satisfy the 
conditions (33) we can take the diagonaJ*elements of S, to be zero to the 
first order of approximation. The non-dfpmal elements are given by * 

S t ( n, to )*= Hx ( 45) 

, ^ * \ hv (n,m) 

The relation (44)# is the analogue Si the classical one, Viz., that the time- 

average of the perturbation function is the perturbation energy. Ill Hei^en- 
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berg's theory, the diagonal elements are time-averages as we have pointed out 
earlier. 

Suppose however that the undisturbed state is degenerate. Let 

Ho (tt\, tti) = Ho ( «2, «2 ) == Hq ( n3, «3 ). (46) 

We cannot therefore conclude that S 0 (n, m) >= 0 when n, m are any two of 
the integers n l9 n 2 , m 3 . We can however assume, if there is no other source 
of degeneracy, that barring these three rows and columns S 0 is a diagonal 
matrix. Moreover the condition for S 0 to be unitary makes all the elements 
of the diagonalized parts of the matrix equal to unity. The equation (42) 
provides the necessary information regarding the non-vanishing elements of 
S 0 and the average values of W v If n # n lt n 2f n 3 the equation (44) still 
holds good. For these three states we have the following homogeneous equa- 
tions : 

So (#n »i) [Hi (nu n\) - Wi (n lt nj] + S 0 (n h fh) Hi (n 2 , fti) 

+ So (n h n%) Hi (n z , th) = 0 

S 0 (tfi, ni) Hi (nu n s ) + So (»i, fh) [Hi (th, m 2 )- Wi (n u ni)] 

+ S 0 (nu fii) Hi (n it n 2 ) = 0 

So (flu tii ) Hi (tiu «i) + S 0 (flu fi 2) Hi (nu M3) 

+ So (flu «a) [Hi (tiu m 3 )- Wi (nu tii)] = 0. 

Eliminating S 0 (ti lf nj, S 0 (n Jt n 2 ) and S 0 (n lt m 3 ) we get a determinantal 
equation for the value of W 3 (n lt n 3 ). The same equation gives Wi ( n 2 , n 2 ) 
and W x (n a , w 3 ). Hence the three missing terms of W x come as the roots 
of a cubic. , 

2. THE HARMONIC OSCILLATOR 

We will take up in this the simple problem of the harmonic oscillator* 
and show how it has been worked out in the matrix theory. The solution of 
this problem in accordance with Schrodinger’s theory has been already in- 
troduced earlier. The matrix Components can also be evaluated from Schrd- 
dinger's 4>^ unct i° ns as was first Vointed out by him. The general connections 
between the wave functions and tW matrices will be shown in the next section. 

iV 

For an qxillator, X 

i\ 

H = + \ (48) 

and « + ( 2»vo )* q = 0, V 

where ‘ ( 2srv<j)* - p./w«. 



(49) 

(50) 
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Assuming the matrix 


q(n,m)e 2iriv{n ‘ m)i 

fot,g we get on substitution in (49), when it is treated as a matrix equation* 
{ v* (n, tn) — Vo*} q(n m) = 0. (51) 

Hence q{nm) = 0, unless v(n,m) = ± v 0 . (51') 

The exchange relation 


qp - pq 

becomes for the diagonal terms 

ih 


ih 

2 7 r 


(52) 


— ■ S ( q ( nk ) • m 0 • 2 ir rv ( k,n ) q (fen) 

4 IT At 

- m 0 2ir iv (n,fe) q ( nfe ) ? (fen) J . 


(53) 


Since 


v (n,fe) = — v (fe,n) and 9 (fen) * q x (n fe) (54) 

Sv (n,fe) | <7 (fe«) I* ■ ~ (55) 

k Off Wq 

We can now evaluate the energy of the oscillator in a stationary state from 
(48). 

H (nn) = 2 i?m Q S { v 0 2 + v 2 ( w, k) } | q ( nk ) | *. (56) 

k • 

The three important equations on which the subsequent discussion hangs are 
(51), (55) and (56). From (510, q <n m) •= 0 unless 

H (nn) — H (m m) = ± hv Q . (57) 

If (55) is to hold good it is necessary that all q (n m) should not be zero. 
Hence for any n there exists an m such that (57) is satisfied. In the absence 
of degeneracy there are only two values of m satisfying the last condition. 
Let us take 

H(nn) - H(n— 1 1) = /iv 0 , 

H(nn) - H(n + l n + 1) « - /jv 0 . 

There is no loss of generality in this a9 the order in which terms are arranged 
in the matrix form has not been specified We Have now 

v (n, n-1) = v 0 , v (n, n+1) * - v 0 , 
q ( n n*fl )^O, 0 (»tf + l)#O t 

— # . j ulZsZ - 

» 9 

* Henceforth we will find it convenient to denote the matrix element* q (it, m) 
etc. as q ( nm ) etc. * 

11 
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all other values of v and 9 -components vanishing. Hence, from (56) 

H(nn) * 4 jr* »»o v o* { I q{nn- 1 ) I s + | 9 (»» + 1 ) |*}. (58) 

The equation (45) now reduces to 

= ± v 0 | | q(n n - 1) | s - | q(n n + 1) | * j • (59) 

From (58) and (59) it can be shown by considering a series of difference 
equations, that the equations 


H( tiK n *) = hv 0 ( k + J), 


(60) 

l9(«*«k + l) l*“ 1 9(«*«fc-l) 

1 2 ~ h 

8 jr s wlo v o > 

(61) 

, u , h(k + 1 ) 


(62) 


hold good. The( ordering of the suffixes is done so that n 0 corresponds to the 

ground state ; n v n 2 , n k are the successive higher states of energy. The 

^-matrix is of the following form if we take n„, n 2 in the natural 

order of integers. 


9(01) e 2 *'^ 0 - 1 ^ 0 


9(10) 


2rriv(l,0)f 


0 . 9(21)c 2ir,v(2,1)< . 0 


9(12) e 2Ml,2)t , 0 , 

9(23) c 2 - (2 ' 3)/ , 0, 


(63) 


The final results may now be expressed in the familiar form : 

H(« «) » hv 0 (tt + $), (60') 

h(n + 1 ) 


1 9(» n + 1 )P 


(62') 

(64) 


8ir 2 nt 0 v 0 9 
v(n + 1 , n) * v 0 . 

The theory of perturbation is illustrated by the anharmonic oscillator for 
r which we take Y 


where 


H * Ho + X $ 8 

A-T £ + ¥*:«■• 


The matrix of H % is given by 


( 


ffiffttm)'* 2 2 X £ q (n k) q(k l) q(t^\) ^ 


To carry out the summation for k and l we have to n 6 te that for any n, 
th£ non-zero members of the 9 -matrix are oily 9 (»-l «) and 9 (» n + 1 ). 
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Hence we need take into consideration merely the following sets of values for 

k, l, tn. 

k : (»-l), (« + l); 

/ : ( n - 2 ), n, ( n + 2) ; 

tn: (« — 3), («-l), ( » + 1 ), ( n + 3 ). 

k 

It is obvious that the non-zero terms of ft are of the form 

H x (nn - 3), ft (n n - 1), ft (»w + 1), ft (w w + 3). 

The diagonal terms of ft are all zero and, therefore, the first order perturb- 
ation energy is zero. The calculations have to be pushed to the second order 
of approximation fort getting the energy arising out of ft. 


3. THE (CONNECTION BETWEEN WAVES AND MATRICES 


Schrodinger and Eckart have independently shown that the same physical 
facts are represented by Heisenberg’s matrices and the Schrodinger wave func- 
tions. The equivalence follows from the Fourier expansion theorem by means 
of a set of normalized orthogonal functions and the curious representation of 
a momentum co-ordinate by a differential operator. 

Let u be any classical function. Then we can associate a matrix with 
it through a set of normalized orthogonal functions $ k . For consider the 
function u 4>„. Let us write the expansion of this function as 


U 4 = 2 u ( k n ) <J>. w = 2 w ( n 

n k *< n k *• 

• 

The values of the co-efficients u (n k) satisfy the condition, 


(65) 


u ( k n ) = f u\ n ^d i. ( 66 ) 

Similar matrix elements can be associated with any other function v. These 
arrays of functions satisfy the matrix rule of multiplication for 


uv(n,m) = Ju v d x 

= Jv'Zu(Jk) d x 


= 2 2 u ( « k) v (j m)J ty^tyjd x 

— 2 u (n’f) v (k m). % • (67) 

That Heisenberg’s matpces follow from the solutions of Schrodinger s 
equations# has been sjjpwif^y considering the integral, 
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This integral is interpreted as the (n, m) dement of the matrix for j (q, fi). 
<J>„* and <|> M are now solutions of SchrOdinger’s equation of the system. The 
volume integration is carried over all space. The Hdsenberg matrices satisfy 
the condition, 

. . ih 

pq- qp m - - — 

& 7 r 

Using (68) we have 

- / 4>.*( - £ e, « + 1 & 

■- f£ 27 w <“» 

Hence the matrices given by the rule obey the exchange relations. We can 
show that they satisfy the matrix equations of motion, viz., 

- ~q = Hq- qH,- p - Hp- pH. (70) 

Z IT u 7T 

For, consider the function Hq ipm : 

Hq <l> m “ Hhq (km) <K = 2 q ( km ) H (J)* 

= 1q(km) Eic <l> k , 

where E k is the characteristic energy for II k . Similarly, 
qHij m = q E m ty„ = E m q < 1 >*. 

Now the value of the integral, 

J 4**» x (Hq — q H) ty m dx 

- £« ftySqtymdx. 

The numbers q (km) are constant ; so also £ fc . Therefore 

/<!>** (Hq - q H) <| > m dx = (E„- E m ) q„ m 

— h Qnm m qnm • (71) 

The matrices defined by Schrodinger’s functions satisfy the equation 

, H q — q H = — ~ q. 

Similarly it can be shown that the kher equation of (70) is also consistent 
with SchrSdingeris definition of matrices. * 

Thfc weaknesses and limitations of the wav/ theoi*^ have beefi already 
indicated. The correspondence between ihat theory and Heisenberg’s is so 
dose, that hone of its defects could be removed by working out problems in 
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terms of matrices. One unsatisfactory feature of the wave treatment was the 
postulated equivalence of a momentum coordinate and a differential operator. 
This was originally suggested by an analogy from the mathematical theory 
of optics. It is interesting that the equivalence is due to a connection between 
the Poisson brackets and the exchange relations. Dirac began with a genera- 
lization of these relations as given by (28). It introduced what he first called 
^-numbers and c-numbers, the former satisfying the non-commutability rela- 
tions and the latter being subject only to the properties of complex numbers 
and functions. That has very much enlarged the scope of the theory. It has 
presented more generalized wave functions and the new perturbation theory 
is made applicable not only to the energy function but to other dynamical 
functions as well. 


4. PROBABILITY POSTULATES 

Certain fundamental considerations regarding the part played by proba- 
bility in quantum mechanics must be presented here. In the classical theory 
of probability some writers introduce the concept by considering the ratio of 
the equally probable favourable possibilities of an event to all the equally pro- 
bable possibilities. Those who are dissatisfied with this procedure cm account 
of the phrase 1 equally probable \ which implies an a priori knowledge, give the 
other definition which depends upon frequencies and statistical distributions. 
In wave mechanics, when we talk of probabilities we mean statistical frequen- 
cies and not a priori probabilities. For observations interpret the probability 
in statistical terms only. Wherever the concept is used theoretical predictions 
or inferences are made exclusively for large numbers. In # certain processes 
if we still continue to think of individual particles that is confined to questions 
regarding the energy and momentum. As the matrices and waves are con- 
nected by the probability interpretation and as the calculus of probabilities in 
the quantum theories of matter and radiation is subject to laws different from 
the classical we give below a set of postulates centred round that concept. A 
clear statement of them has been provided by Ruark 3 and Urey. 

44 (I) There always exists a function </> (x y, F 1 F 2 ) called the probabi- 
lity amplitude such that 

*(xy, F, F t ) (x y, F, Ft) = l l (xy, F, Ft).” , (7?) 

Let us denote a fixed value of a certain dynamical variable F 2 by y 0 and en- 
quire : what is the probability that another variable F x has a value lying 
between x 0 and x 0 4- 8 x 0 ? The answer js provided by P (r* y 0 , F t F 2 ) 8x 0 . 
Similarly, if x 0 is the fixed value of F 1 thp probability that F 2 lies between y 0 
and y 0 + 8y 0 is P (x 0 y 0) F u F t ) 8y 0 

(II)® The function F^xy, F 1 F 1 ) must satisfy the conditions, • 

P(*0 yo, Fi Fi) « 1, * if X o * y 0 » 

“0, if fio * yo • 


(73) 
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For if it is definitely known that F a has a fixed value x 0 the probability that i 
has at the same time any other value must be zero. This is certainly obviout 
Before performing a certain experiment suppose that the probability F a = x 
is F. Once it is known that F x = x 0 at a certain instant, if our finding is t 
have any value it is necessary that, on performing the experiment again, a] 
relevant disturbances being properly controlled, the probability that F\ m x 
must be a certainty. Hence the effect of a measurement is to disturb th 
probability function. This fact has to be taken account of in framing th 
postulates of probabilities^ The function as given by (73) is of the sam 
type as the Kronecker delta which is generalized in quantum mechanics, fo 
continuous representations into the function 8 (x) : 

m = 0, x * 0 

f l(x)dx = 1, (74 

*i 

where x x and x 2 are any numbers so that x 2 > o > x 1 . 

(III) According to the classical law of combination of probabilities 

P(x z, Fx Ft) dz = J P(x y, Fx Ft) dy P(y z, F 2 F,) dz. (75 

Given the value x of F 1 we want to find out the probability that Ft lies bet 
ween z and z 4- 8 1 it being assumed that F 2 may have any value y. Th 
answer provided by (75) j s no t true in quantum mechanics. The equatioi 
(75) is replaced.by 

4(xz,FxF t ) = f <t> f xy, Fx Ft) $ (y z. Ft F t ) dy, (76 

and the required probability is obtained from (72). 

(IV) It is assumed that the probability functions are dependent exclu 
sively on the functional forms F x (p, q) and F a (p,q), being independent o 
charges and other special properties of the mechanical systems. 

It is on account of the form (76) that the wave functions can be, in man] 
problems, complex as functions df x, y, z. 

Suppose 1 that the probability of transition between two states w-*/ i 
indicated by \S' nl | a . Let there be a further disturbance causing transition 
of the type l-*m, the probability being |S" Im | 3 . Hence the probability that 
a system which' was in the state n initially might be in the state m finally ii 

I *S' nl S"i m |*. (77, 

f . .i * _ 

This probability mu$t be distinguished from the f^robatftfty 
I SS'« I* I S"xm I’, 


( 78 : 
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which would be true on the classical theory. The question now arises : what 
is thq difference in physical significance between (77) and (78)? If we take 
(77) this probability is based on the understanding that no measurements 
are made in the intermediate l state. On the other hand (78) shows the effect 
of an actual measurement made to find out how the transition takes place 
from n to m. Unless the intermediate state is observed the term \S" lm \ 2 has 
no place in our theoretical estimate of the transition n->m. 
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VI STATES AND OBSERVABLES 


1. A Mathematical Analogy. 2. The Calculus of Operators. 3. The 
Postulates and Principles of Dirac’s Theory. 4. The Uncertainty Principle. 

5. The Electron Spin. 

1. A MATHEMATICAL ANALOGY 

%The quantum mechanics as expounded by Dirac is a theory of states and 
observables founded on the concept of probability. By way of introducing 
the new philosophy a mathematical analogy is presented here. When the 
purpose is served the analogy must not be stretched further. Moreover once 
a good acquaintance is cultivated with the basic terms and processes of the 
theory it is no longer necessary to consider any analogies since the formal 
mathematical structure is itself extremely simple and beautiful. 

Suppose we have to describe an ellipse. In Cartesian co-ordinates it may 
be '©yen as 

ax' + 2 hxy + by' = 1, ab - h' > 0, (1) 

taking the origin at the centre. In polar co-ordinates every point on the ellipse 
become? known when the value of the radius vector r is given for every 0. 
There exists a different unit vector for each value of 0 if 0 < 0 2*. 

Let us call each' of the vectors a state. Corresponding to every direction in 
the plane of the ellipse there is a state ; and corresponding to each state there 
is a magnitude r which we call the vldue of the observable (ellipse) for that 
state. When we know the values of the observable for all possible states 
the ellipse becomes completely determined. 

Fortunately all this is not needed for a complete knowledge of the ellipse. 
It is enough to know the directions of the principal axes and corresponding 
values of the observable. The ellipse can be constructed with this much in- 
formation only. The four directions of the principal axes give four eigen- 
states and the corresponding values of r give the eigenvalues. Sufficient in- 
formation of the ellipse is provided by the eigenstates and the eigenvalues. 

By a transformation of axes, 1 

x^ATcos# — Ysintf 

y * Xsm 0 + Ycos 0, (2) 

we have from' (1) 
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where A r* * a cos* 0 + 2h sin 0 cos 0 + b sin* 0, (3a) 

2T* « a sin* 0 - 2/i sin 0 cos 0 + b cos* 0 t (3b) 

0 « (b — a) sin 0 cos 0 + h cos* 0. (3c) 

The form (T) gives directly the eigenstates which are four unit vectors 
parallel to Ox and Oy and the eigenvalues A and B. 

Between (1) and the matrix 

( : i ) » 


a one-to-one correspondence can be established at once. If the equation (1) 
represents a class of ellipses distinguished by different values of a, Jr, b 
the matrix (4) can also be treated as an efficient representative of the same 
dass of ellipses. The operation of transformation (2) gives the new matrix 


/ cos 0 sin \ / a h \ / cos 0 — sin 0 \ 

\ - sin 0 cos 0 ) \ h b ) \ sin 0 cos 0 ) 

and this is equal to 


/ a cos* 0+2h sin 0 cos 0+ b sin* 0 (b — a) sin 0 cos 0 + h cos 2 ft \ 
\ (b - a) sin 0 cos 0 + h cos 20 a sin* 20 -2h sin 0 cos 0+b cos* 

Hence when the conditions (3) are fulfilled, we have the diagonal matrix 



as a representative of the observable, 
the matrices 


B-" ) . <« 

If we denote the matrix (4) by a and 


/ cos 0 sin 0 \ '* / cos 0 — sin 0 \ 

\ - sin 0 cos 0 ) * \ sin 0 cos 0 ) 

by S’ 1 and S it will be noted that 

q-i s m ( cos# sin 0\ /cos 0 -sin0\ s /l 0\ 

\ — sin 0 cos 0 ) \ sin 0 cos 0 ) \ 0 1 / ' 

also the matrix (6) is equivalent to S- 1 a S. The condition (3c) diagona- 
lizes the matrix giving the form (4') corresponding to (T). The eigenstates 
and eigenvalues are given by the set of equations 

(a - X)x + hy =* 0, 

hx + (b - X) y = 0, (7) 

which express the conditions for t to be a maximum or a minimum. Elimi- 
nating x and y we get the determinantal equation 


a - X fh 
h b - h 


= 0 


( 8 ) 

» 
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of which the roots are A. = B~*. When' these values ot X are substituted 

in (7) the eigenstates follow from the ratio y/x. 

In the above) we have shown how an observable such as an ellipse can be 
represented by a matrix. The significance of diagonalizing the matrix is 
pointed out in the notation of transformation of axes. The meanings of eigen- 
values and eigenstates are made explicitly clear. The latter are represented 
by mutually perpendicular vectors. When two or more eigenstates correspond 
to the same eigenvalue there is said to be a degeneracy. A as well as B corres- 
ponds to two different eigenstates and thus the concept of degeneracy i9 illus- 
trated in the case of the ellipse. 

Suppose we have to consider another observable, also an ellipse. Call 
it * and let the matrix 

w 



stand for it. The states of a can be represented in terms of two eigenstates, 


<J> . and r> of a by an equation, 

A D 

= 4^ cos0 + sin 0 (10) 

which is the usual equation for expressing one vector in terms of two per- 
pendicular vectors in the same plane. Now the most important question is 
whether by a suitable transformation of axes both the observables can be put 
in the form (T) or (4')- Obviously this is possible only if the two ellipses 
have the same principal axes. When the two observables are simultaneously 
in the diagonal form 

1 ( A" 2 0 \ / A~ 2 On 

V 0 B A 0 BT 2 ) 

« S -1 a' S~ l a S 

= S~ l a! a S. 

= a' a. (11) 

Two observables whose matrices commute are called compatible. Their ana- 
logues 'are ellipses whose principal axes are common. It is easy to trace the 
steps backward and prove that when the condition (11) does not hold good 
the two ellipses cannot be simultaneously expressed in the form (T). For 
non-compatible observables a and a', (11) is not true but they can be shown, 
to be functions of some fundamental observables for which Dirac has postu- 
lated relations of the form 

/> V. 

«• , pq — qp - - 

Actually in the quantum theory tht space-time co-onJinate$ themselves 
along with the other dynamical variables are the observables and the space in 


/ A - 0 wA'" s 0 \ 

Vo it 8 A o £'- s ) 

or S- 1 a S ■ S’ 1 a' S 

or S~ l a a 1 S 

Hence a a' 
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which the observables are represented is a Hermitian space of an infinite num- 
ber of dimensions. The analytical operations of this complex space which 
give Dirac’s theory its distinctive features have very few analogues among 
those of the Euclidean space. It is not advantageous therefore to pursue the 
analogy further. 


2. THE CALCULUS OF OPERATORS 

The mathematical theory underlying the quantum mechanical generaliza- 
tions by Dirac is very often described as a calculus of operators. An opera- 
tor is a rule for replacing a function by a function. In Dirac’s theory the 
functions on which the operators act are wave functions, <J> and 4> x . They do 
not represent waves but as they are of the same class as Schrodinger’s function 
this name is used for them. The operators having a physical significance are* 
the observables which are so called because the numerical values that they 
represent can be identified with the results of measurement. 

Let us denote an operator by T. Suppose 

Tfix) = gix), 

that is, the result of the operation is to convert f(x) into gU). In particular, 
let glx) be the derivative of /(*). If S is any other operator which stands 
for multiplication by x, 

ST f(x) - Sf'(x) = xf'(x). 

Obviously, the operation ST is not the same as TS. For 
TSfix) ■= Txfix) = fix) + xf'ix). 

Hence ( TS - ST) fix) m fix), 

or TS - ST V l. 

This introducesi the simplest of all operators, the identity. Consider now an 
operator A which transforms fix) into giy) 

Afix) = giy). (12) 

The equation may be of the form 

giy) = f Aix, y) f(x)dx (13) 

or if x and y each take n discrete values 

giy) = S a iyx) fix). (13') 

* = 1 

It is well known that the transformations of this kind form a grou$, IThe 
reader is introduced to the f&stract idea of a group in the discusSion jon* force. ] 
So if A and B ase two operators acting in a specific order on any function 
the result is the same as that of a certain, operator of the same groftp. If 
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h{z) - E b{zy) giy), 

then h(z) = EE*(sy) * (y*) /(*) 

* E c(zx)/(x). 

Hence the matrix law of multiplication is satisfied by A, B, C ; for 

c(zx) = S 5(zy) (y*). (14) 

y 

For a full working knowledge of any operator A it is enough to find 
out what vector f(x) is left by it unchanged in direction. The linear trans- 
formation associated with A gives us a matrix. The characteristic vectors 
j{x) which remain invariant are given by 

Aj « af (15) 

or la(yx)f(x) = afiy). (16) 

Since x and y take only integral values from 1 to n, the condition that 
(16) may have a non-trivial solution is, in the usual notation, 

I Ors - rirs I - o. (17) 

The roots are the eigenvalues of A ; for each root we get a vector f(x) from 
the homogeneous equations. When all the roots are unequal there are n 
orthogonal vectors defined by the roots. If two or more roots are equal we 
can still construct a set of n mutually orthogonal vectors. But the eigenvalues 
associated with some of the directions are equal. 

The vectors f(x) and g(y) represent physical realities independent of the 
co-ordinate system chosen. Suppose therefore that in a new system the 
vectors are represented by F(x) and G{y). The law of transformation which 
was originally 

Af(x) =» g{y) 

now becomes A' F (x) ■ G(y). (17) 

What is the connection between A and A' ? To answer this question suppose 
that the two co-ordinate systems are connected by a transformation T so that 

T f(x) = F (x) 

or f(x) - T* 1 Fix) ; (18) 

and <• T giy) = G (y). 

Therefore AT~ l Fix) = giy) 

or TAT- 1 F(i) = G(y). 

Hence A''= TAT -1 . (19) 

Suppose that the matrix A is referred to its_ characteristic vectors as axes. 

Let the n vectors be given by <l>(xu) where u ^>1, 2, p. In the 

new co-ordinate system they will foe described by the functions <j>' iyu ) : 

f iyu) = U yx <f> ixu) (20) 
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where is the matrix of the transformation. Since 

<l>'(yu) - 8 yu ( 21 ) 

it follows that r l yx = <t> (xu). 

Hence in the new co-ordinate system A is replaced by A' : 

a-'yu m Myx a*i i~\u “ ^yx «« <#•(*«) “ a u V < 22 ) 

The new matrix is diagonal and its non-zero members are the eigenvalues of 

A. 

Usually the components of the vectors f(x) are complex numbers. Hence 
the length of the vector is L so that 

V = 2 /(*)/*(*) (23) 

The condition for the invariance of L requires the transformations to be 
unitary. 

In Dirac’s theory the dynamical state of a system is represented by a 
vector <1>. But the state has no longer the same meaning as in the classical 
theory. We are accustomed to talk about the state of a system at an instant 
as given by the values of certain variables. In quantum mechanics what is 
meant by the state is the familiar vector which by itself has not a physical 
meaning. But we can construct the expressions of probability current and 
density from <]>. All the mechanical possibilities that are inherent in the 
system are brought out by these expressions and also by the integrals 

ftymFty„dz 

* 

where F is any observable and and are the same or two different states. 
As the state vector <J> has complex components the whole theory requires two 
separate conjugate spaces in which the conjugate imaginaries <J> and 4> x are 
defined. These spaces are Hermitian and they may be of infinitely many 
dimensions. The operators simply play the part of relations of connection 
for two state vectors. By the normalization of a vector is meant that the 
length is unity. For one variable, the condition of normalization is 

oo 

/+ X W+(*W* =1- . . 

-00 

Suppose that a dynamical system is given by the normalized state vector 
$(.%), so that $ {x)dx is the probability that the system is between x 
and x + dx. If we want to know the probabilities of the various values of 
energy it is necessary to change the co-ordinate system so that the new prin- 
cipal axes are those of energy. Accordingly a transformation function t(Wjc) 
must be found so that the'new state vector is given by* . • 

* = fmtoiMdx* 


( 24 ) 
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From the analogue of (21) for continuous x, 

r l (w,x) * ( 25 ) 

which gives us the vectors of energy with reference to the first co-ordinate 
system. Obviously <J> (x, W) must be the characteristic vectors of energy as 
given by 


//<!> = Wi l>. (26) 

where IF is a constant. (26) is in fact Schrodinger’s equation. The reciprocal 
of the transformation which leads us from the state vector of position to that 
of energy is the Schrodinger wave function. Both position and energy cannot 
be simultaneously diagonalized on account of the observables being incompa- 
tible. As the transformation t (W, x) must be unitary, (25) is equivalent to 

m%) = <!>*(*, W) (25') 

and / i> x W<t>(W)dW = 1. (27) 


Hence the probability that the energy lies between W and W + dW is <£ X (TF) 
This is the answer to the question : if <J> ( % ) ( x)dx is the proba- 

bility that the system lies between x and x + dx what is the! probability that 
its energy lies between W and W + dW ? 

A similar question may be put regarding the momentum also. In this 
case in place of (26) we have . 

pi - M (28) 

Po being the eigenvalue of p and the state vector for momentum. In view 

of the exchange relations between p and x we may write (28) as 

- £ % - f *' 1, (29 > 

and obtain, using the normalizing factor, 

$ (po, g) = e 2ripaXlk (l//,) 1 ' 3 (30) 

The momentum vector <j> (p 0 ) in the new co-ordinate system is 

4>(po) m f$* (Po, x) 4 >(x)dx 


■ *->»/♦« „„ 

So the probability that the momtentum lies between p 0 and /> 0 + 8 po is 
<t>* (po) i> (po) 8 po when the probability that the position lies between x and 
* + 8x is <}>* (*) 4» ( x)8x . 

The wave functions of Dirac’s theory provid^the probability amplitudes 
as required by the first postulate of the calculus of probabilities. If we look 
into the remaining postulates as stated id Chap. V they will be found to be 
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fulfilled by the transformation functions. Any two observables which are 
not compatible with each other are symmetrically connected by a transforma- 
tion which lead to the required probability function, symmetrical with respect 
to the two. Regarding the second postulate we know from (24) and (25) 

that 

/ VX W) 4> (x, Wd dx = 4> (W) - 5W W x (32) 

if ^ (*, W x ) is a solution of Schrodinger’s equation (26). In the most general 
case can similarly be proved the relation analogous to (32). The fact that 
the probability amplitudes are linear transformation functions forming a group 
acoounts for the third postulate. As the determination of these transforma- 
tion functions is independent of the total charge or the form of the energy 
functions and such characteristics, the fourth postulate also is automatically 
satisfied. 

To find out the probability relation between the momentum and energy 
we have to restate the operational equation 

Hty = Wty 

differently. It is assumed that originally the state vector for momentum is 
known. Hence is expected to be a function of p and W. The operator H 
is therefore expressed as 

H{p, q) = H{p, eidp). (33) 

This is permissible since it is consistent with the relation 

. . ih 

Pq - qp m - g- • 

The eigenfunctions of the operator (33) are similar to the wave functions 
which satisfy the Schrodinger equation. 

3. THE POSTULATES AND PRINCIPLES OF DIRACS THEORY 

Having acquainted the reader with the terminology and procedure of 
Dirac’s theory we may now state, at the risk of some repetition, the fundamen- 
tal postulates on which quantum mechanics is based. The theory divides" 
itself into two parts. In the first part one deals with the connection between 
the components of an atomic system! at any instant and, in the second, with 
the progress of motion. Historically* the .first fundamental postulate is that 
concerning the equality of a commutator with the corresponding Poisson brac- 
ket of the classical theory. If F x ( p , q) and F 2 (p, q) are two classical 
functions » 

Fi Ft - Ft Fi = ^[F,.Fd. 


(A) 
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For the first part of the theory the classical relationsi.of the Poisson brackets 
provide the quantum conditions 

pq - qp * - ih/2 jr, etc. 

As regards the other part the Hamiltonian equations suggest quantum rela- 
tions of the type 

~F X = F X H- HF U etc. 

* Zir 

The analogues of the classical dynamical variables are operators. They are 
called observables. In addition to those which originate in classical mecha- 
nics there are other observables such as the electron spin and the permutation 
variable which are of the quantum origin. Compatible observables commute ; 
non-compatibles do not. The theory of states and observables is developed in 
order to get measurable effects for observables comparable in the limit with 
those of the classical theory. The representation of states is by vectors in a 
Hermitian space satisfying the principle of superposition. It means that the 
only possible connections between the states of a dynamical system are linear 
as given by 


*1 4»1 + *2 <l»s + + Xn 'I'" = 0, (B) 

where x lt x 2 x n are numbers and 4b . . . <J> n are all vectors. As the 

operations are all linear in the theory, if ^ a nd $■_ are two solutions of an 

operational equation so also is 
« 

C\ 4b + c 2 4b, 

where c x and c 2 are constants. Even the transformation equations of the 
form , 

♦M = t * 111 dp 

are examples of the principle of superposition. It is interesting to note that 
if the relation (B), holds good at one instant it is valid for all time. The 
co-efficients x lf x 2 , etc. do not change but the states may change with time 
without violating the original form of superposition. Two features of this 
'principle are noteworthy. If the same state is superposed on itself no new 
state results. Secondly any state that is obtained by superposition from two 
or more different states can be decomposed into any one or more of these 
states for the prediction of experimental results. This is permitted) by the 
probability interpretation that is attached to the states. What is really novel 
about the superposition principle may be put this way. If <J> is a state, inter- 
mediate between 4b and 4b» ft ft n °t true to say ^hat the result of<> c|> would 
be intermediate between those of 4b and ^4b* For the vectors, by themselves, 
have no physical meaning. But it is rignt to say that the probability asso- 
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dated with <J> lies between the probabilities for and exposition as 

given by Dirac the following physical hypotheses enter. 

44 In the sperial case when the result of a particular observation made on 
the system in a particular state is with certainty one particular number, a, 
say, then the Hermitian operator, a say, representing the observable that is 
measured and the <\> —vector, <|> a say, representing the state are connected by 
the equation 

a tya = atya. (cj 

Conversely, if this equation holds, a measurement of the observable represented 
by a made on the system in the state represented by <j> u is certain to lead to 
the result a” 

(Dirac : ‘Quantum Mechanics* (1935), p. 30) 

“If the measurement of the observable represented by «, for the system 
in the state represented by <]> x is made a large number of times, the average 
of all the results obtained will be 

a<K (D) 

provided </>* and are normalized.” 

(Dirac : ‘Q.M.’ (1935) p. 43) 

In Dirac's statement of (D) </>* stands for 4>, x . 

If 4>i, $ 2 , .... etc. are orthogonal normalized functions let 


<lv x 4>s = A + iB. (34) 

Then -<l>s x 4v = A - iB. (34') 

This is a mathematical postulate of the theory. These relations are used to 

define the conjugate observable a*. It js given by 

4v x a x = c - fl), (35) 

if a 4i r - c + *D. (35') 


In the above A, B , C, D appear for real numbers. The complex observables 
for which the eigenvalues are complex numbers have no physical significance. 
In quantum mechanics we deal with real observables only. 

Let be all the independent observables of a system everji 

two of which commute with each other. Then it is possible to put them all 
simultaneously in the diagonal form. It can now be shown that for any 
other observable a to be diagonalized in the’ £ - representation jt is necessary 
and sufficient that a be a function of £’s. ‘The transformation theory connects 
up the £-state vectors with the restate vectors where r a> . . . . t)* is 
another complete set of independent but compatible observable^ The 
connecting equation is 

W) = h') 


13 


(36) 
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or if $ are continuous,* 

♦ M -/*«'>*' eg in'). (360 

Given the state vector <]> ($), the transformation function must be obtained by 
solving the equation 

r)<l> - t/4>. (37) 

It has been already shown in (25) that this is the reciprocal of (£ | rj. In 
order to solve the operational equation must be expressed in terms of 
This is done by means of the non-compatibility relation 

($q - i# - K, kj] • 

The physical hypotheses arising out of the transformation may now be 


briefly summed up. Any normalized state vector is represented in the £- 
space by components (§ | ) so that 

4* = I <!»($') ($' I ). (38) 

$ 

Similarly in the t)- space, 

* ■= s<l» W (V I ) (38') 

so that from (36) 

(n' I ) = S (V I $0 ($' | ), (39) 

where I(tj' | Q ($' | if) = Zr ft*. (40) 

Similar* equations follow when £' and if are continuous variables. The pro- 
bability that a system when referred to £’s is in the state is given by 

m = i i ) i 2 . (e) 

Similarly, the probability that the system is in the same state r/ is 

P(rj') « I (if | ) | 2 (E) 


when it is referred to the rfs. Moreover, on account of the reciprocal nature 
of the two transformation functions as given by (40) we have the theorem of 
reciprocity : 

“ The probability of the £’s having the values for the state for which 
if s cerjainly have the values if is equal to the probability of the r/s having 
the values if for the state for which the £'s certainly have the values 

(Dirac: ‘Q.M/ (1935) p. 66) 

t KS'h'r- I (*{ I SO I *• (F) 

(E) leads to the definition of transition probabilities and expectation values. 
The vectors <|> in (38) being normalized 

S I (g I ) I * = 1. 

e 

, 

• * Hew 7)', T)" etc. represent the eigenvalues. 
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This is as it should be if the probability interpretation of (E) is to remain 
valid. Suppose that an observable operating on <|i (£) generates a function 


«*<£). Let 

a*(£) = (£' I )• Wl) 

Then V(Z) « 4> ($") = ($' I ) (42) 

The left-hand side is usually expressed as (£' | a | ?') so that 

a 4> (?') = I 4> (50 (5' I a | 5"). (43) 

Also, P- I (S'UI £')!’. (44) 


P is the probability that a system which was originally in the eigenstate (£") 
goes into the state (£') as a result of the observation a being made. For a 
real observable 

4> x (£') a <M£") and <J> X (£') a <l> ($0 

are conjugate imaginaries. Hence £' and £" may also interchange their places 
in (44) without changing the value of P. The expectation value for any 
particular state is the same as given by D. 

In fine, we will consider how the expectation values 4 change with time. 
Consider any operator F (q, p) expressed in the g-space. For a particular 
state vector the corresponding expectation value is 

F « fty x F(q, -g- 8ldq) <J> dq. (45) 


Hence 

2 - /[$*♦♦*•#■ 3 ]* 


Since 

2 * -If. 

%;[**)* F-r my, 



” ~h /** + dq 



« f 4> x IF, fl] 4> d<7, 

(46) 

in the notation of the Poisson bracket. Thus the mean value of the Poisson 


bracket [F, H] equals dF/dt. This is the analogue of the classical result 


f - 

The expectation values of qs and to ' s follow the Hamiltonian equation in the 
limiting base when h = o. From (46) it is obvious that 

dB in 
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If we put F = H», we find also that 



0 . 


( 47 ) 


As (47) is true for all values of n it gives the analogue of the principle of 
conservation of energy. If F is any classical integral such as that of conser- 
vation of momentum, it is true that 



= 0. 

(48) 

Hence 


(49) 


and the quantum-mechanical analogue is presented by the constancy of the 
expectation value. 


4. THE UNCERTAINTY PRINCIPLE 

There are two aspects of the uncertainty principle ; one is purely experi- 
mental while the other is theoretical. The experimental aspect is not new. 
It has been well-known that the most delicate operations of experimentation 
can disturb a system so much that an unknown measure of uncertainty viti- 
ates the final results. The frequency theory of probability is, in fact, based 
upon this hypothesis of experimental uncertainty. The theoretical aspect is 
new, but not as new to the metaphysician as to the physicist. 7 Consider, as an 
example, the hackneyed question of the position and velocity of a particle. 
From the experimental point of view the uncertainty principle means that 
both the position and the velocity cannot be accurately observed at any instant 
as the observation of one disturbs the other. If we take the purely theoretical 
standpoint the qestion is whether the t concept of velocity at an instant has 
any physical meaning. Average velocity has an observational significance 
but 4 velocity at an instant * is a logical fiction. 

As the empire of known facts grows in size the physical hypotheses are 
contradicted and the mathematical postulates cannot be reconciled with the 
new logic of phenomena. Heisenberg 5 was the first to show that the tacit 
assumption regarding the possibility of determining at once both the position 
and the velocity of a particle cannot be true in atomic physics. The corres- 
ponding modification of the theoretical structure was brought out by Born 
ivhen he gave the probability interpretation of Schrodinger’s wave function. 
Dirac’s theory of incompatible observables is a full-fledged exposition of the 
uncertainty pririciple so far as it relates to quantum mechanics. The method 
of the second quantization for wave fields of radiation similarly establishes 
uncertainty relations for the measurable quantities of a radiation field. 

The fjrst intimation of the microscopic natural phenomena beint; govern- 
ed by uncertainty relations came with the two quality relations, 

* Av, p m *At, 


(50) 
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which Einstein had established for light and de Broglie had assumed for 
natter. Consider a wave group of extent a x associated with a particle at 
some instant. Let us suppose that the group is formed of wave lengths near 
about \ 0 . Then if n wave lengths \ 0 are contained in Ax at least n + 1 waves 
of a neighbouring wave length A 0 — AX must be contained in Ax. Hence 

Ax = «X 0 > (n 4* 1) (Xo ~ AX) 


>r 


AxAX , 

Xo 2 


Since z = 1/X 0 = pjh, (1/X 0 2 )AX = A pjh 

Ax A p > h. (51) 

We may also write this in the form 

vM • tip^>h 

because Ax = t/A t, also v&p — a E for a particle. Therefore 

AE A/ > A. (52) 

From the first relation of (50) also (52) can be deduced. 

The uncertainty relation for position and momentum can also be deduced 
from (31). For if 


*w = 


i 

W' 2 ) 1 '* 



2 nip\X 

h 


(53) 


1 “CT) t{p - Pi? ~ 2 J (P ~ P°)*o 

= e ^ (54) 

When p has the mean value the conventional measure of uncertainty is, as 
given by |0(/>)| 2 


1 h 

s/ 2 2nd ’ 


(55) 


Similarly when x l is the mean value of x the uncertainty is a . It is clear 
that the product of the two uncertainty measures is of the order of Ji. In # 
the choice of |c | )(%)! we are led to introduce a phase factor which does not 
alter the probability |<J> (x)| 2 in order that the determination of <#>(/>) may be 
simplified. This proof depends upon the form of the normalized function 
(x). But the relation (31) can also be expressed in Diracs notation as 


, fcl >-»-*/. ***‘'*(,1)* (56 

If |(0|)| is practically negligible outside a! region A q then the variation in 
wave length h/p must give, by the^argument which gives (51) 
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A q _ Aq 
hlpo hl(p 0 - Ap) 


** 1 


or A q Ap ^ h . 

This is brought out in Chap. IV (4) more explicitly. 


(57) 


A very general statement of the uncertainty relations can be deduced from 
the calculus of operators. The expectation value of any real observable is 
positive. Using this the uncertainty relations have been found out. 6 If A 
and B are real observables and A is a real number, the expectation value as 
shown by a bar overhead must be positive for (A + iX B) (A + i A B)*\ 

Hence (A + i X B) (A + i X £)* > 0. (58) 

Therefore A 1 + }? X 2 - i (AB - BA) X 0. 

Choosing X so that the left-hand side becomes a minimum we have 


or 


a- 

A' S' 1 > i (AB — BA?. 


Consider the observables 


(95) 


IA = A - A, 8 B = B - B. 

Then 8/1 8 B = 8B 8/1 + AB - BA. 

Hence , 8A S 8B 2 > 1 (8A 8 (60) 

> i (AB - BA)' 1 . 

Let A and B stand for p and q then 

AB — BA — h/2ir 

and (8pP (fy? > < 61 ) 

Modem physics presents several interesting applications of this principle. 
In the study of solids subjected to high pressures we are told how a limitation 
on the closeness of packing arises and why solid matter would assume an amor- 
# phousnform. In the theory of spectra the principle explains the natural line 
breadth. The phenomenon of the fluctuation of the electric force vector, when 
there are no photons in the field, is also understood in the light of the un- 
certainty relations. In quantum statistics, for the same reason, the area of 
the smallest cell in the p—q spacer is h. We will dose this account with a 
discussion of the angular momentum vector, {H x , H yt H a ). 

We have^ for a particle in a central field of force 

yP t - zP y - H Xt 
zP x - xP t - Hy, 
xPy - yP x * Hr 



STATES AND OBSERVABLES 


103 


If \ (H x , H„ HJ = (M x , M y , M.) 

the operational equations become 


y 


dz 


dty 

dy 


z 


di( 

dx x dz 


i M, <1>, 


i My <J>, 


(62) 




i M z 4». 


These three equations are not compatible with each other unless M a = M v = 
M e = 0. Hence it is not possible to know simultaneously all the three com- 
ponents of the angular momentum vector. They are not a set of compatible 
observables. It can be shown by direct computation that 


H,Hy - HyH, = (yP, -zPy) ( zP x - xP x ) - (zP.-xPJ (yP, - zP f ) 

= y P t z P x - z' Py P x - yx P\ + zx P y P x 

— zy P x P x +■ xy P‘i + z‘ P x P y — x P x z Py 
=* y P„(P X Z - z P x ) - a: Py (P x z - z P x ) 

“ (y P,- X Py) (P t z- zP t ) 

= - 2 *. Pz- (63) 

But H* H x = H t H\ IP H x = H x H\ H 2 Hy = H y H\ % (64) 

Hence H 2 and H z can be regarded as compatible. So it is possible to know 
the magnitude of the resultant and also of one of the components simulta- 
neously. The uncertainty principle tells us why the maximum possible value 
of a component does not agree with tlfet of //. The reader is reminded of 
the fact that if the maximum value of H z is Jh/ 2tt, the value of H 2 is / (/-hi) 
h 2 / 4n 2 , If the maximum were the same as the value of H the knowledge of 
one component would be enough to know the vector, the other components 
being zero. Thus all the three components H x , H vt H z being known, by 
Heisenberg’s uncertainty relations, the angle* about the x, y t z axes would be 
completely undetermined. This is only possible if the probability distribution 
is spherically symmetrical, that is, if the angular momentum is zero. • 

(G. Herzberg, “Atomic Spectra and Structure” (1937) p. 102). 


5. THE ELECTRON SPIN 

The Iheory of generalized operators outlined above has been sqccsssfully 
applied to explaintatomic and molecular spectra and to various problems of 
physics and chemistry. The methols have also attained considerate success^ 
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in nuclear physics. The statistics of charged particles, the arrangement of 
atoms in crystals, the distributions of the electronic shells about the nucleus 
and spins of the valence electrons: these are questions on which quantum mecha- 
nics has given accurate answers as verified in the applications to ferromagne- 
tism, conductivity, valence, etc. It is the electron spin that happens to play 
the most conspicuous part in all these phenomena. The spin hypothesis was 
first suggested by Compton in 1921 and later by Gondsmit and Uhlenbeck in 
1925. It led immediately to an explanation of the multiplets, the anomalous 
Zeeman effect and the Pa9chen-Back effect as well. The Pauli principle was 
formulated and it cleared up at once most of the long-standing difficulties of 
the periodic table. The mystery of the Land£ g- formula was partially solved 
but the factor 2 which arises in the relationship between the electronic mag- 
netic moment and the angular momentum continued to be baffling in spite of 
the explanation given by L. H. Thomas. Dirac's relativistic theory of the 
electron gave a logical explanation of the spin and resolved the difficulty of 
the factor 2. 

From the relativistic form 3 of the classical Hamiltonian we get for the 
wave equation of a particle, 

{ W/c - (mV + p,* + p? + fitf } 4 = 0. (65) 

in the usual notation. This equation merely takes account of the relativistic 
variation of mass but has not the proper symmetry between the energy and 
momentum components to give the necessary invariance. By multiplying the 
operator on in (65), by another, viz., 

7 + («v + P* + PS + ptf 

it is possible to get a symmetric wave equation, 

{ - (m 5 c ! + p? + p? + p?) } * - 0. (66) 

The last equation isi not permissible on the postulates of quantum mechanics 
as the energy operator is not linear. Hence Dirac undertook to investigate whe- 
ther a linear equation of the form 

| W/c + a* p x + dy p y + a, p t + (3 j =0, (67) 

invariant under the Lorentz transformation, cannot be true for an electron in 
free space Thfe operators a ’s and $ must be independent of the energy and 
momentum and also of the co-ordinates x, y, z, t. Hence they must bring out 
some innate property of the electron such as the spin, if it exists. Multi- 
plying <67) by { Wlc - a, p M - a, p y - a, p, + P } we get 1 
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Comparing (68) with (66) we find 

a/ - 1, a* a y + a y a* = 0, a x $ + ga* * 0, (69) 

and similar relations for the other components of a. One effect of the com- 
parison with (66). is that the new equation has to admit negative values of 
the energy. We can obtain the following representation of a : 

a, = pi a* a, * pi a,, a* = p t a,, a« = p s (70) 


/o 1 0 o\ 
/ 1 0 0 0 \ 
°*“l 0 0 0 1 I 

\o 0 1 0/ 


( 0010' 
0 0 0 1 
10 0 0 
0 10 0 . 


'0-i 0 O' 
i 000 
0 0 0-1 
0 *■ 

' 0 0 -» 0 N 

0 0 0-i 

1 000 
0*00 


'1 0 0 0 ' 

0-100 
0 0 10 

0 0 0 - 1 . 


/I 0 0 0\ 
_ / 0 1 0 0 \ 
’ P 3 “ I 0 0 -1 0 ) 
\o 0 0-1/ 


The wave equation (67) may now be written in the form 


| W/c + pi ( o • p ) + p 3 me | 4> ■ 0. 


Hence following the classical procedure of replacing W and p by W+e <f> and 

A A 

p + e A/c for the motion in an electromagnetic field, we write 


\ Wlc + e</> + p, ( o, p + e A) 
( c c 


+ pa me 


If the four-vector A stands for <f>, A x , A y , A t and the four-vector p for 

p* 

W/c, pt» p y , p g we can write (72) in fhe symmetrical form * 

{ *11 + C ^ C A |L ^ + a " mC ] 4* " ^ 

The Lorentz invariance of (73) can be proved by showing that if 

P.. m A „ " V A 'v *' = (74) 


r x a. r 


v r * amT 


the equation (73) transforms into 

r x |a v (p' v + etc A' v ) + a m me ] 4»' = 0. (76) 

When the equation (72) is written in the quadratic form, on multiplying it 
by the factor 


»F + 


pi ( o, p + — A ) - Ps me, 


•The dummy-suffix convention tAs been used in (73) -(76). 
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we compare it with the classical quadratic equation obtained from (66) after 
jnaJring tlfe above mentioned alterations for an electromagnetic field ; we find 
that the new equation is equivalent to 


E') } «J> - 0 
(78) 

if the form of the classical Hamiltonian is resorted to. The new terms are 

the last two. In them H\ E\ are the magnetic and electric intensities. 
Because calculation shows 

If - curl A, E' = — |( + grad 4 , ). (79) 


/ ^ 1 ^ i \J / J I ^ A \ J 2 ^ / A TTf\ • M 

( 7 + 7 +) ~ {p + 7 A) - m c 2^ c {a ’ 


Hence the electron possesses the additional potential energy, 

he r . he . * . 

4ttWZC °* 1 4n-wc °* 

unknown in the classical treatment. The electron has a magnetic moment 
— he/Arrmc g and an electric moment — i p x he/Amnc $ which is imaginary. 
When we calculate the angular momentum by the formulae 


* Mx = y Pz - z py, = zp x - x p z ,M z = xp y - yp x 
the electron is found to have a spin angular momentum h/Ax. “ The same 
vector o fixes the directions of both the spin magnetic moment and the spin 
angular momentum. If an electron in a certain state of spin has a spin angu- 
lar momentum i h in a particular direction, it will have a magnetic moment 
— eh/2mc in the same direction.” 

(Dirac “OM” (1935) p. 265) 

Dirac*s h is equal to h/2rr in our notation. 
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VII THE ATOM 

1. The Problem of n Particles. 2. The Energy Levels in the Atom. 3. The 
Number of Elements. 

1. THE PROBLEM OF n PARTICLES 

Consider a system of n identical particles ; they may be electrons, neutr- 
ons or atomic nuclei. It is not possible to distinguish one such particle from 
another of the same group. This fact is often referred to as the identity 
principle. Hence, if two such particles interchange their places the physical 
system doesi not alter, which has to be taken account of, in a study of the physi- 
cal states of two or more identical particles. In a theoretical presentation one 
associates suffixes and coordinates with particles for the sake of convenience 
and clarity. But they prove to be distinguishing marks and therefore the des- 
cription of a physical state has to be so given that the different suffixes and 
coordinates describe only one situation even if the electrons represented by 
them interchange places. Suppose 1 that / is an observable of the system. 
The eigenvalues of / must be the same as those of Pj, if P is any jtermuta- 
tion of electrons. The restriction on the possible observable / of the system 
is sufficient to obtain all information about it. Here the equation of expect- 


ation values in a state is, in the usual notation, 

/ = jy (P/)<Mx. (l) 

If the operation P” 1 is performed on each of the functions in the second in- 
tegral the latter must still be unaltered because' of the identity principle. 
Therefore, 

JVWrfT = J(P~' 4>) X /(P"‘ (2) 

or f ifffydx ^ J4’o x /4 , o<*t 1 (3) 

where P -1 4> = 'Vo- ( 4 ) 

If (3) is to hold good for all observables it is just sufficient to take 

/ • 8 (x - «i) 8 (as - at) * . 

to show that 4 )X 4 ) = 4>o x <l>o (5) 

at all pojnts and / ■ 8 (*, — aj 5 (* 3 - a s ) 8'.(x« — «») 

to get the other relation, 
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at all points. From (5) and (6) it follows that 

k i i ia “ l . ia 
4> = 4>o * = P H , 

«s ■ . ia (7) 

or Py * 4* e 

a being a constant. Consider now any<J> function described astju/* 

where each suffix stands for an electron. Let an interchange of electrons t, ; 
take place so that 


tyjik * e a . 

If again an interchange of the same two electrons take place we have 

. . ia , 2ia 

tyijk . . . . = yjik . . . . e = tyijk e . 

Hence e 2t<x = 1, e ta = ± 1. (8) 

From the probability formula, cj> x 4>, it follows that the effect of a permuta- 
tion may be of the perfectly general form (7) as the phase does not affect 
the probability density. However the only functions which are of physical 
interest are given by (8). If 

tyijk = - (9) 

the systejn is said to be antisymmetrical while the wave function which obeys 
the symmetry condition, 


4>i# = * • • • (10) 

gives the symmetrical system. The syfnmetry or antisymmetry of a system 
is a physical quality and it is independent of the coordinate system used. 
Dirac 8 has shown from the laws of quantum mechanics that the symmetric 
or antisymmetric nature of a system remains invariant for canonical trans- 
formations. From the laws of motion he has also deduced that a system which 
is at one time symmetrical (or antisymmetrical) remains always symmetrical 
(or antisymmetrical). This particular property may be verified from the fami- 
liar Schrodinger equation for electrons expressed in Cartesian coordinates. 
The equation runs as follows : 


h 34> 
2jrV *8t 




( 11 ) 


The potential energy is a symmetric function of the coordinates. Hence if $ 
is symmetrical , or antisymmetrical the same property is shared by,, dtyldt. 
This showtf the invariance of the particular characteristic enjoyed by 

Let us .discuss at some length the field inside a many-electron atom. 
*9Sto^nathematical discussion for any assembly of n electrons can be carried 
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out on precisely the same lines. If the nucleus has a charge ze and the 
number of electrons is n the Hamiltonian is 


n 

H - S 

i=l 


a 

\2 « 


2w 


5.* - 2C 

fi 


+ $ (r 4 ) Li • Si) 


ft 

+ s 

i>j = 


in/ 


(12) 


The motion of the nucleus is assumed to play a negligible part. The dis- 
tances of the electrons from the nucleus are given by and from each other 


by Uj. The spin-orbit interaction energy is assumed to be £ (n) Li • S%. 
Thomas and Frenkel have shown that the spin-orbit interaction energy for 
an electron in a central field with potential V (r) is 


1 (1 dV(r) \ j 

2mV \ r dr ) 


A A A 

•se^WL'S 


(13) 


where L is the orbital and S the spin angular momentum. In writing (12) 
we assume that this form of coupling gives a very good approximation to the 
real interaction energy. When the ith electron is at a great distance from 
the nucleus as compared to the other electrons, r i} is approximately the same 
as r* for the n- 1 distances. Hence the effect of the mutual repulsion is to 
produce a screening so that the nuclear potential, as toned down by the last 
term of (12), becomes -(z-n 4- 1 )e 2 /r v for the ith electron. It is now pos- 
sible to split up (12) into two parts, //i= H 0 + H' : 


H n = l[^ m ^+V(u) ], . (14) 

t 

H - 2 h (n) Li ■ Si - Zfi8 - V (r,) ]+ S — . (15) 

i L • r * i>j = 1 r%j 

The Schrodinger equation for (14), viz., 

» Eo (16) 

possesses solutions of the form 

4> =» to (fid <i> 4 fa) 4>. (w) 

where S r , stands for the four quantum numbers, n, l, m t s of the rth electron* 
Since 4h» $2 etc. all satisfy the same differential equation, except for the 
eigenvalue constant which is different for different functions, ~ $2 when 
0 l m Each electron moves in a central field as if the other electrons are 
absent : 




V s + Vto]<Wi) - Eo (fid <1> (*«); 




Eo 


= liEjt (&d. 

1=1 / 


(18) 

JjSL 
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There is a two-fold degeneracy of ty. Firstly <jv fir) is independent of m and s. 
Secondly, the total energy E 0 is the same for the » ! different permutations 
of the electrons carried out through the suffixes. The most general solution 
of (16) is therefore of the form 

* = J \Ci Pi [ (Oi) 4>* (<$y 4 »h 0 n («)] (20) 

where each, Pi is a different substitution, being one of the n ! substitutions 
on n letters. C’s are all consonants. If the electrons constitute a symmetrical 
system, 

all C’s are equal. (21) 

If they constitute an unsymmetrical system, 


Ci = (-l)U, (22) 

where A is a constant and i is the number of transpositions into which P { 
can be resolved by any method. Let us take <[>,, c|> 3 . . .etc as normalised 

wave functions. For a system of n particles we can have two possible <]>. 
functions. 



*1 fix) 4»i W $1 fin) 

4>2 to) .. 


4>* to) to) fin) 


(23) 


The square array is to be evaluated as a determinant for an antisymmetrical 
system and as a permanent (which ha§ all the terms of the determinant 
written with the same sign) for a symmetrical one. The electrons in an atom 
constitute an anti-symmetrical system. For according to the Pauli exclusion 
principle no two electrons in the atom are found to be with the same set of 
quantum numbers = 0j\ and when we put any two 0's as equal in the deter- 
minant (23) it vanishes as two columns become identical Hence the non- 
existence of electrons with the same quantum numbers is explained on the 
hypothesis that the electrons in an atom are represented by an antisymmetrical 

Quantum mechanics has nothing to say as to why the Pauli principle 
nolds good in the atom. We merely draw the conclusion from this observed 
fact that electrons satisfy the Fermi-Dirac statistics or that their statistical 
states are governed by an anti-symmetrical wave function. Similarly there 
is no quantumfoechanical explanation why the protons obey the Einstein- 
Bose statistics, that is, why the Schrodinger function of the orchestra of har- 
monic oscillators for radiation is symmetrical. The law of black-body 
radiation rperely tells us that Nature has favoured this statistics for radiation. 

An examination of the rotational bands of diatomic ‘molecules reveals 
. jghgt statistics the nuclei follow, The r^o of the intensities of alternate 
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bands gives the nuclear spin ; what lines are weak and which are strong is 
decided by the statistics that the nuclei obey. There is ample evidence that 
the nitrogen nucleus obeys the Einstein-Bose statistics. This would not be 
reconciled With the earlier view of the constitution of that nucleus. Because 
electrons and protons both follow the other statistics ; and if there were 14 
protons and seven electrons in the nitrogen nucleus it would claim an antisym- 
metrical <J>. For example consider two atomic nuclei each of which consists 
of m electrons, p protons and n neutrons 3 . Neutrons also satisfy the Fermi- 
Dirac statistics. Hence one interchange of two electrons, of two protons or 
of two neutrons changes the sign of once. If the two nuclei are to exchange 
places it means m + n + p exchanges for the constituent particles. The sign 
of 4> would be altered into 

(~D m + n + P 

in this exchange of two nuclei. When m + n + p is even the Einstein-Bose 
statistics is obeyed ; when m 4* n + p is odd the other statistics is valid. If 
the old view of the constitution of the nitrogen nucleus were correct, m = 7, 
n = 0, p = 4 and the answer would be wrong. But according to the new 
view which has many other arguments in support, m~0, n-7, p~7 and 
the right answer is obtained. 

2. THE ENERGY LEVELS IN THE ATOM 

To a certain degree of approximation the electrons in an atom may be 
assumed to be characterized by four quantum numbers in the manner explain- 
ed previously 2 . Each set of spatial quantum numbers gives a level which by 
the Pauli principle, is occupied by two electrons at the most. When there are 
two they are paired, that is, their spins are ±J. In a heavy atom which is 
placed near uranium in the periodic ta&le, the various levels of energy in the 
ascending order are : 

Is; 2s, 2 p ; 3s, 3 p, 3 d t 4s, 4 p f 4 d, 4 f ; 5s, 5 p f 5 d ; 6s, ... . (A) 

On the other hand, the levels in a lighter atom with atomic number less 
than 20, are arranged in the order : 

Is ; 2s, 2 p ; 3s, 3 p ; 4s ; 3 d ; 4 p ; 5s ; 4 d ; 5 p ; 6s ; 5d ; 4/; . . . . (B) 

• i 

There is a gradual transition from the latter order to the first as we pass on 
from calcium to the higher atoms. The atom should be looked upon as a 
hollow in the usual energy representation, the various discrete levels of nega- 
tive energy ranging from the lowest Is level to the top beyoncf which there is 
a continuous range of positive energy levels due to hyperbolic orbits. The 
break in the order as in (B) is duo to the interaction energy being too large. 
The variftus elements are grouped in the periodic table accbrdiog*to the 
character of the electrons in the outer shell. The inert gas column is charac- 
terised by closed shells and s® p® jubshells. The hydrogen column contains. 
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elements with only an s electron in the outershell. It is interesting to see how 
potassium is promoted to this column through the sudden break in order at 3 p 
as shown in (B). The alkaline earths are characterized by two s electrons in 
the outermost shell. Since the periodic table is mainly concerned with the 
grouping of elements according to their chemical properties it is but to be ex- 
pected that the outermost or valence electrons should be the chief index for 
position in the table. It must be emphasized however that the physical 
properties such as conductivity and refractive index vary from element to 
element in the same column as they are not dependent exclusively on the 
outer electrons. The central principle on which the different levels of the 
normal atom (in an unexcited or ground state) are occupied in what is called 
the building-up principle. According to it the electrons occupy the lowest 
possible levels without violating the exclusion principle. 

It is by means of spectra that the theory of the order of levels is verified. 
With each atom is associated a term type of the ground state as also of the 
other states. From the spectral character these types can be inferred. On 
calculating the term types theoretically it is ascertained that the grouping 
of electrons in the shells as given is correct. How the inner 3 d shell is 
gradually filled up in the case of the eight elements Sc to Ni, the inner 4 d 
shell for the group of eight, Y to Fd, the inner 4/ shell for the rare earth and 
the 3 d shell for the atomic numbers (71-78)— all these facts are suggested 
by the term types and the variation of the ionization potential from element 
to element. Some preliminary explanations will have to be* given before the 
calculation of the term type is made clear. 

The angular momentum for the} spin of the electron is always hj 4ir, the 
magnitude of the spin itself (being We will take h/2-rr as the unit of angular 
momentum for the sake of convenience. When there are n electrons in the 

r\ 0 

atom their spin vectors S* combine to give the resultant spin vector whose 
allowed values are 



The seriesl terminates with 0 if tt is even and with J if n is odd. When all 
the spin vectors are parallel we get n/2. When the allowed value is S, the 
exact amount of the angular momentum (scalar) is Js(S + 1 ) according to 
‘the rigorous treatment of quantum mechanics. Obviously S is integral (or 
half-integral) if the number of electrons is even (or odd). The multiplicity 
of an atomic term is defined as 2S 4- 1. The term is even (or odd) as the 
number of eledrons is odd (or evep). We now understand why the atoms 
arranged in the order of their atomic number have alternately even and odd 
term types. The reason why 2S + 1 is called the multiplicity may also be 
made dear now. * 

Let the resultant orbital) angular momentum vector forithe electrons be t 
m tji the maximum value given by the intd^er L. Then the law of interaction 
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between L and S permits for L>S, the following 2S+1 values for the resultant : 

L - S, L - S + 1, • • • • L + S. 

But if S>L , the following 2L + 1 values are permitted : 

S - L, S - L + 1, S + L. 

There are therefore (2 S+l) different values of energy* in the more frequent 
case L>S. Hence (2 S+l) is defined as the multiplicity of the term. When 
we speak of the s S term for which L=0 and S=l, there being two electrons, 
we describe the multiplicity as 3 just to stick to this convention although the 
number of distinct energy levels is 2 L+l ■= 1 in this case. The resultant 
of the L, S vectors is denoted by J. 

To determine the vector L it is not necessary to take account of all the 
electrons in the atom. Those in closed shells and subshells may be dropped. 
For Xm = 0 for all the levels of a subshell, hm/2ir being the component of the 
orbital angular momentum in an arbitrary direction. For the calculation 

A A 

of L and obviously for that of S as well it is sufficient to consider the electrons 
of the outermost open subshell or subshells. In the case of two electrons 
with orbital angular momenta /j and l 2 the possible values of the resultant 
are 

l\ + ht l\ + h “ 1> 1 1\ “ h I . 

Successive additions of vectors are to be carried out by this rule. 

In the case of a small interaction between the electrons it is represented 
by a small velocity of precession of the vectors L .... about the resultant 

A 

L and the orbital velocities have a physical meaning. But when the velocity 
of precession is large the interaction is strong and the orbital velocities cease 
to have any meaning. The mathematical procedure, however, allows the use 
of the individual quantum numbers even in the case of strong interaction. 
This is because the content of Ehrenrest’s adiabatic principle is implicit in the 
assumptions about When there is a strong coupling between L and S the 
vectors appear to possess a large velocity of precision about the resultant 
vector /. When L - 0 the term is S ; for L—\ it is a P term ; L= 2 gives 
a D term and so on. Thus when we say that the ground state of oxygen is 
3 P 2 we mean that L = 1, S = 1. That accounts for the symbol P and 
the prefix 3. The total angular momentum / is given by the suffix 2. There 
are 3 different / values of which the one that corresponds to the lowest energy 
must be specified and that is done in this case by 2. A tend is said to be 
normal or regular when the energy values of the term increase with /. But 
if they decrease the term is called inverted. When the number of outer 
electrons i& more than half of what the subshell can contain a% in. the case 
of chlorine, the inversion takes place. The result is dependent upon detailed 
calculations of the spin-orbit interjection matrix. The actual derivation of 
15 
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the term symbol which depends upon the value of L is not a simple matter. 
Hund’s rule is found to be successful here. It says, “ Of the terms given by 
equivalent electrons, those with greatest multiplicity iie deepest, and of those 
the lowest is that with the greatest L ” (Herzberg, " Atomic spectra and struc- 
ture” (1927) p. 135). 

The anomalous Zeeman effect can be calculated quite accurately without 
going into the intricacies of the matrix mechanics. The resultant magnetic 
moment is to be calculated for the atom on the assumption (proved by 
Dirac’s relativistic treatment of the* electron) that the ratio of the magnetic 
moment to the angular momentum is twice (for the spin) of that for the 
orbital motion. The Paschen-Back effect in strong magnetic fields is ex- 
plained by the fact that the L-S (Russell Saunders) coupling which we have 
tacitly assumed above ceases to give a good approximation to the ihteraction 
energy and that the J-J coupling scheme has to bei assumed, / being the total 
angular momentum for an electron. 

The existence of two kinds of helium is now very easily understood. 
Parhelium exhibits transitions between singlet states S = (0 and orthohelium 
the triplet states S = 1. When the electron spins in helium are paired we 
get S = 0 and the spatial quantum numbers satisfy the condition of symmetry. 
But if the two electrons are parallel or 4 bachelors' S = 1 and there is an 
antisymmetry of the ^-function for the spatial co-ordinates. To the first 
degree of approximation the selection rule, AS = 0 holds good for the emis- 
sion lines. But it breaks down for the higher atoms. The appearance of the 
so called forbidden lines does not imply that the theory is wrong ; on the 
contrary it supports the theory inasmuch as weak lines which are not per- 
mitted by the selection rules, derived from first order considerations, are given 
by the higher approximations. The concept of a metastable state is dependent 
upon this notion of forbidden lines. In the case of helium 2 3 S is a metastable 
state and it is above the VS level, because the transition from the former 
to the latter is impossible as first order considerations show. 

We have already referred to the possibility of forbidden lines 4 arising 
from the quadrupole moment The integrals giving the transition probabi- 
lities do not identically vanish for the quandupole moment if they do for the 
b dipole moment. Secondly, in a stationary state when L and S differ from 
zero the total magnetic moment vector does not lie in the same direction as 
the angular momentum vector on account of the notorious factor 2 in connec- 
tion with the spin. The magnetic moment therefore precesses about / being 
ultimately responsible for forbidden lines given by the magnetic dipole radia- 
tion. 

e t • 3, THE NUMBER OF ELEMENTS 

The ancient Hindus and the Greeks believed that the material universe 
jajsuilt up of five elements. They wei^ not far from the modem view if 
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they meant by the elements the three states of matter, solid, liquid and gase- 
ous and ether and radiation. It was too broad a hypothesis to give birth 
to a new science. What is meant by elements in physics or chemistry to-day 
is quite different from what the ancients meant by the five elements. 

All substances are complex or simple. A complex substance can be 
shown to be made up of other substances with different chemical properties ; 
a simple substance cannot be further analysed into chemically different sub- 
stances and is called an element. Table salt, water and sugar are illustrations 
of complex substances while carbon, copper and sulphur are simple substances 
of elements. From prehistoric times are known the elements, carbon, sulphur, 
iron, copper, silver, tin, gold and lead. Until recently only ninety different ele- 
ments were known. They have now discovered also ekaiodine and ekacaesium 
which are very similar to iodine and caesium in their chemical properties. 
Their respective numbers in the periodic table are 85 and 87 respectively. 

What is the maximum possible number of chemical elements? The 
adjective ‘chemical’ is used because it is the chemical properties that dis- 
tinguish one element from another. There may be any number of distinct 
isotopes of an element, all of them being chemically similar but varying in 
their physical properties. Thus there are three isotopes of hydrogen already 
known ; their weights are approximately in the proportion 1:2:3. Jeans, in 
his theory of the internal constitution of stars, had come to the conclusion 
that stellar stability depends upon the existence of elements of atomic num- 
ber nearly 95. It was advanced that the elements from hydrogen to uranium, 
the first ninety-two, abound in the surface while deep down in the interior 
of a star only heavier elements exist. Although no element heavier than 
uranium had been discovered the fact that the energy expression of the 
relativistic Bohr model becomes infinite for the atomic number Z, nearly 
equal to 137, was interpreted as meaning that the maximum number of 
chemical elements is 137. The question apart of the possible existence of 
transuranic elements Jeans’s theory of the stability of stars is no longer valid. 

Discussing the question in 1928 Flint and Richardson obtained a lower 
limit on the highest atomic number. Their limit was 98. Even in this case 
the limit was found as a restriction put by relativity on the existence of a 
minimum radius for a circular orbit in the Bohr atom. • • 

The author 7 took up the question of the highest atomic number in 1932 
while working under Eddington on the latter’s relativistic theory of the inter- 
action of two electrons. It was discovered that out of possitye 136 rotations 
only 91 left the interaction term unchdnged. This was supposed to mean 
that the highest atomic number is 92 and that transuranic elements do not 
exist » * . 

In 1934 however came the discovery in Fermi’s laboratory of what looked 
like transuranic elements, numbers 93 and 94. Uranium was bombarded with 
neutrons. The nuclei became unstable with the capture of neutrons. *But 



116 


WAVE MECHANICS 


they appeared to become stable with the emission of electrons. This process 
suggested the existence of transuranic elements. Unstable uranium gave rise 
to four different types of emanations with half -periods 10 sec., 40 sec., 13 min. 
and ’50 min. As there are only three isotopes of uranium one of these periods 
was supposed to be due to the formation of element 93. If it existed it was 
a homologue of Mn and therefore if any manganese salt was added to the 
uranium mixture the element 93 was expected to go down with the preci- 
pitate Mn0 2 . Such a precipitate was actually discovered, diminishing by 
about 15 per cent, the 13-minute and 90-minute activities. The precipitate 
showed no trace of U or UX x or UX 2 . These facts were interpreted as 
evidence of the existence of element 93. It was argued by Noddack that 
the chemical evidence did not necessarily suggest Fermi’s conclusion but the 
alternative hypothesis of uranium breaking up into radioactive isptopes of 
smaller atoms was not generally accepted. Series of chemical tests were de- 
vised in later years by Hahn and Meitner, by Curie and Savitch and by 
Hahn and Stressmann for the examination of active products of bombarded 
uranium. The experiments) were responsible for the sensational conclusion 
that Noddack was right and that what appeared like transuranic elements are 
actually radio-active isotopes of lighter atoms obtained by fission. The fission 5 
of a uranium nucleus may be described as follows : 


239 

92 


U 


140 

54 


Xe + 


99 

38 


Sr. 


The 1 Xe nucleus undergoes four successive beta-disintegrations producing 
54 

m turn Cs-> Ba-» La-»Ce. Similarly, the other nucleus undergoes six beta- 
dilfntegrations according to the chain 


Sr -> Y -> Zr Cb -> Mo Ma Ru. 

The energy that evolves in the course of the fission and disintegrations is 
obtained by Einstein’s relation, 

§E = (? (m 0 — m\ — m 2 ) 

where m 0 is the mass of the original U atom and m 1 and m-j are the masses 
of the final products Ce and Ru, c being the velocity 6f light. Similar 
fission has been observed for Th and Pa. 

The phenomenon of nuclear fission has given rise to various questions. 
How is it that a U-nucleus which is stable gets broken up so easily by a 
neutron? Why* does the nucleus break unsymmetrically ? Why is it that 
no nucleus of mass greater than 238 is found in nature ? Moreover, as it 
is found theoretically that energy can be gained by splitting up nuclei of 
weight greater .than 110 it is a mystery how heavier nuclei fexist ip a very 
stable fomi # 

The theoretical questions have been discussed by Bohr, Wheeler and 
others and Boar’s view is becoming popu^r that the nucleus is something 
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like a spherical drop of fluid, its energy being partly in the surface and partly 
in the volume. Two kinds of forces are supposed to be at work : the forces 
of attraction which are of the van der Waals type and are rooted in the 
surface ; the forces of repulsion whose seat of action is spread over the 
volume. The nucleus is supposed to be stable so long as the spherical shape 
is one of minimum total energy. As the sphere increases in size there comes 
a stage when the sphere ceases to be the shape for minimum energy. It is 
believed that this! stage is reached in U. A more quantitative theory will not 
only answer the question mooted above but the problem of the highest atomic 
number will definitely be solved thereby. All recent work shows that 
transuranic elements most probably do not exist. 

But we can no longer say that the ninety-two elements, chemically dis- 
tinct as they are, are physically independent. Transmutation of elements 
was begun by Rutherford with a particles and he got oxygen out of nitrogen 
in his first successful attempt. The discovery of neutrons by Chadwick and 
the construction of the cyclotron by Lawrence have considerably accelerated 
the progress in this direction. The latest is the fission of the very heavy 
nuclei described above. 

In the beginning of the section substances were described as simple or 
complex. The nuclear particles such as electrons, neutrons and the rest are 
neither simple nor complex substances. They possess rest masses and must 
therefore be distinguished from electromagnetic radiations. It is safe to look 
upon these particles as substances of nuclear chemistry. 

Until very recently it was generally believed that the transmutation of 
hydrogen into heavier elements takes place in the stars. Latest 8 studies oi 
the sources of stellar energy show that hydrogen is being steadily transfomfed 
into helium mainly through the agency of autocatalization in most of the 
stars and the building up of heavy elements out of hydrogen is no longer 
supposed to be possible at the temperature-pressure conditions obtaining in 
the interior of stars. 
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VIII RADIATION 


1. The Classical Theory of Radiation. 2. The Quantum Theory of Radiation. 
3. Phenomenological Successes. 4. Peculiar Features and Difficulties. 5. The 
Positive Electron. 


1. THE CLASSICAL THEORY OF RADIATION 

The classical theory 1 of radiation is based on the following field equa- 
tions : 


curl E + — H 

c 


0 , 


(la) 


div ii m o, 


(lb) 


curl H — 




(lc) 


div § = 4ir p. (Id) 

E and H are the electric and magnetic force vectors, p is the charge density 
•while V is the velocity vector, pv js therefore the current density. As 
t usual c stands for the velocity of light in ether. From (lc) and (Id) follows 
the equation of continuity, 


div ( pv) + p = 0, 


( 2 ) 


which means that the charge is conserved. The force per unit charge in the 
electromagnetic field is given by 


F " ( £ + — V X H) (3) 

c 

where V is the velocity of the charge. 

• ♦ 

^ . /The four equations (1) express the quintessence of the classical electro- 
magnetic theory. Farady*s law of the electromagnetic induction is given by 
(la). The non-existence of discrete magnetic poles of one kind it assumed 
by*(I:b), Amp&re’s law for the magnetic field due to efpctric currents and 
Nfaxwell’s displacement current hypothesis are embodied in (lc) while (Id) 
stands for the fact that charges are subject to Coulomb’s law. 
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For mathematical convenience the field equations are usually expressed 
in the following alternative form : 



i\ 

H = 

= curl A, 

(4a) 

&- 

V s 4 

ii 

(4b) 

¥■ 

- V 8 <f> 

■ 4irp, 

(4c) 

E + 

1 ? 
— A 

c 

= - grad 4 ,, 

(4d) 

div A 

+ 1 

* 0 . 

(4e) 


Although there is a lack of precision about the definitions of A and i> we 
introduce the two potentials since it is more convenient to integrate (4b) 
and (4c). 

From the standpoint of the quantum mechanical applications the most 
important case is that of a moving point-charge e at P for which the 
retarded potentials, 

* ^ r + (v- r)jc | t - tie, (5) 

^ 1 e ! 

A {P> t] ~ c ‘ r + ( v : r)Jc I / - r/c, (6) 

were first given by Lienard and Viechart, the right-hand sides being defined 
for the earlier epoch t — tic . From (4a) and (4d) we get , 

// - - E X r (7a) 

r 

E ** - « PVS 3 (f + - c ) + r X [(? + y r) x k| (7b) 

A r* 

where p 8 - 1 - W, S = r + — ■ 

It is obvious from (7a) that the magnetic field is always perpendicular to r 
and i. The first part of E gives the electrostatic effect whiah is superseded 
at great distances from the origin by the second part representing the'.wavf 
zone. This distinction is valid only if Vic «1. 

In tire case of a harmonic oscillator consisting of a charge « performing 
• mail simple harmonic motion the displacement at time t may be taken as 

xt = *# cos Vt\ (fli 
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i being a unit vector. If j is the unit vector in the direction of f or PO 
where P is an external point and 0, the centre of oscillations, 

H«^v’*,cosv/sin*y^-, 

E « v* x 0 cos vt sin 6 i -~ si ^ C ° S - , (9b) 

0 being the angle between 2 and j! 

* j 

« A _ ® 4 9 9 i • « / 'S f\\ 


E X H 


-«-» V* ** o cos’ vt sin 5 e (- j) ; 


— j is the direction of the flow of radiation. It is clear that — j, E, H 
constitute a right-handed system of mutually perpendicular directions. A 
plane electromagnetic wave can therefore be best described by the equations 
E y = a sin 2 t rv (x/c — /)«#* (11) 

as E x = E z = = H x m 0. The plane of polarization being perpendi- 

cular to E is xOz. 

The energy U and momentum G/c of an electromagnetic field are 
given by 


u = lj (£*+ H 1 )* c. 

G = — J (£ X tf) dz, 


dz being the element of volume. In an exclusive field of radiation in 
which charges do not exist, | G | = U. This result which follows from 
Gauss’s formula and which makes the energy-momentum vector invariant for 
the Lorentz transformation is akin to thk for a photon viz., E - pc, p being 
the momentum scalar and E the energy. 

In the case of the oscillator the energy flow per unit time through an 
area r 2 </<*> at P becomes 

~ E x h t* du) = v 4 * 0 2 cos 2 v/ sin 2 0. (14) 

’Hie total energy flow through the spherical surface of radius r per unit time is 
e* v 4 cos* vt J sin 2 0 tfu) = ^ e* v 4 x 0 2 cos 2 vt. (15) 
2*lv 

Since v r a * . 

cos 2 vt dt i, 

o 

the tune-average value of the energy flow per unit time is 

“ v 4 ^ 

3c* V ** 


( 16 ) 
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The problems of scattering, emission and absorption are treated by ap- 
plying classical mechanics to harmonic oscillators and to their interaction 
with the periodic force of radiation. In the treatment of these and other 
problems the following defects of the classical theory of radiation are en- 
countered : 

(1) The existence of a point charge implies an infinite electrostatic 
energy. If we take the electron as a sphere of radius r 0 of the order of 
e^jmc? it has a self -energy and a self-momentum due to its motion in the 
field as given by the field laws. If the total energy and momentum of the 
particle and the surrounding field are considered the laws of conservation are 
fulfilled but the four quantities do not provide a vector to which the Lorentz 
transformation can be applied. The difficulties of the radius r Q and of the 
origin of the electronic mass are interconnected and have not been solved as 
yet. The physical principles of the old theory as well as the new fail here. 
The question as to whether mass has electromagnetic origin or not must be 
examined in the light of the latest results of nuclear physics. Eddington 5 has 
derived the equation 

lOr - 136* +1 = 0 (17) 

from certain fundamental considerations and he concludes that the ratio of 
the roots, which is nearly 1845, must be exactly equal to the ratio of the 
masses of the proton and the electron. 

(2) The interaction term for an electron and a light wave contains 
powers of r 0 /X. As the structure of the electron is neglected in tile theory 
we cannot legitimately discuss an interaction, a large part of which comes 
from the structure term rJX. Hence we cannot successfully apply the 
classical theory of radiation if r ( ,/X is qpt small. 

These are the intrinsic difficulties of the theory. The phenomenological 
difficulties have been already mentioned in different contexts. The quan- 
tum theory which we describe in the next section shows how light behaves as 
waves in some situations and as particles in others. Between the dualism of 
light and that of matter there is one essential difference. Quantization takes us 
from particles to waves (of probability) in the case of matter and from 
(electromagnetic) waves to particles in the case of light. If h - 0 matter is 
corpuscular and light is waves. 

2. THE QUANTUM THEORY OF RADIATION 

It is possible to cast the radiation problem into the Hamiltonian form 
and then apply the rules of quantization. 2 Let us first take the simpler case 
of the pure radiation field for which <f> ** 0 and A satisfies^ just the two 
conditions, 

V S A - -j A * 0 , (18a) 


16 
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6iv A * 0. (18b) 


Consider the radiation enclosed in a cube of side, L t A being periodic on the 
surface. Let L be the unit of length. A may be chosen so that 



A = 2 (/) a x (x, y, z). 

(19) 

Then if 

*x + v \ «X “ °* 

(20) 


***X + 9 ^X - «• 

(21) 


The equation (20) is equivalent to that for a harmonic oscillator for which the 
Hamiltonian is 

H \ m i {p \ + V \ (22) 
p x being equal to q^ . The total energy of the field can be shown to be 

^x- 

So far no quantum condition has been used. We have to take the 
quantum conditions 

P x _ 9 y P x = - f- , p \ % - q \i P X = 0, etc. (23) 

In practice there is a change in the procedure because we have to start with a 
more general expansion of A than (19) in order to make the results widely 


applicable. We now have 

A - SU X A x +q* } A X x ) (24) 

where the cross denotes the conjugate complex character. If 

°x- fl x+«V p \ fv x ( «x"« x x ) (25) 

the energy of a single wave is given by 

H x = i(P 2 x + v\ & x ). (26) 

The quantum conditions are 

p x «x - °x p x ■ - » 2 ’ • P ). <V - ®|. f x ■ »• CtC - (27 > 

The eigenvalues of energy are given by 

E x = ( n x + h)hv x (28) 


n x being a noft-negative integer. This produces a difficulty. In a finite 

volume of radiation there are an infinite number of oscillators and hence the 
total energy being 2 E must be infinite even if the quantum numbers n . 

# « A t A 

are zero. ’This is impossible. To avoid the difficulty we express (26) as 

“ v *x ^ q \ 9 *\ + q *\ q >J ~ 2 v \ g *\ V ^ 
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On the classical theory (29) is valid. From (25) and (27) we get 
and hence E. = n. hv. 

A A 


The state of the field of radiation is given by the numbers . 
tions A are of the form 




(8ttC*) 2 ^ 



( 30 ) 

(31) 
The func- 

(32) 


where X^ gives the direction of propagation and is a unit vector which is 
parallel to the direction of polarization. Since div A * 0, we get 

A 


o 

II 

<><* 

(33) 

Owing to the boundary conditions on A> X takes the values 

A 


* / 2* 2 n* 2r \ 

X \ L Xx, L Xy, L Xz / 

(34) 

where n ^ etc. are all integers. 


X 2lT / 2 1 ° 1 2 \ 1/2 

= r ( n\ + n\ + n\ ) . 
c L Xx Xy Xz 

(35) 


In the new theory the light quanta appear only through the quantum numbers 
n ^ associated with the harmonic oscillators. 

The more complicated problem of the interaction of particles with radia- 
tion is also treated by this method, pie Hamiltonian is first constructed. It 
consists of the energy of the particles, the electrostatic field and the wave zone. 
Only the wave zone can be quantized since the total energy momentum for 
it can be represented by a four-vector. The electrostatic energy, which is 
represented by longitudinal waves, does not possess corpuscular properties. A 
rigorous treatment of the problem has to take account of the four different 
states in which an electron can exist corresponding to the signs of its energy 
and spin. 

If the question is asked, “ What is light, waves or particles ? ” the modern 
answer is that it is neither. Light is what it is in the interaction phenomena 
of matter and radiation. The phenomena are real but all mechanical models 
devised to explain the phenomena fail. When the frequency of radiation 
exceeds the? value 2 mcVh an electron pair may be created and it is not there- 
fore right to talk of a pure radiation field of such a high frequency. If one 
insists upon getting some sort of definition of light the Answer *may be 
given that light i% an ethereal orchestra of harmonic oscillators conducted by 
matter according to quantum rules. But such definitions serve no important 
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purpose in science. Those who know the mathematical theory can do with- 
out the definition which is not precise enough. Those who do not under- 
stand the theory will not be enlightened by the definition because it is too 
technical. 

The quantum theory of radiation must be looked upon as logically neces- 
sary for quantum mechanics to be a complete theory. Consider for example 
a thought-experiment illustrating the uncertainty principle on which quantum 
mechanics is based. Suppose that the position of an electron is observed 
through a y-ray microscope. The uncertainty of position is of the order of 
the wave length. If the energy of light were transferred continuously by 
waves the momentum of the beam which kicks the electron before reaching 
the eye could be made appreciably small by regulating the incidence of light 
In that case the uncertainty relation A p A q ~ h/2* would not be valid. The 
wave theory of light is not therefore consistent with the uncertainty principle. 


3. PHENOMENOLOGICAL SUCCESS 


We will describe in this section certain phenomena in the explanation of 
which the quantum theory of radiation has attained a good measure of suc- 
cess. The task will confront us with questions relating to the transition pro- 
babilities and hence we consider the general perturbation theory first. 

Suppose that the Hamiltonian of a quantum system can be split up into 
two parts so that 

Ho + H (36) 

where H is small compared to H 0 and the eigenfunctions tyn and eigenvalues 
E n corresponding to the latter are known. Thus 

Ho 4 = E t <Ki. 

We proceed to solve the equation 

(a+ «R-gf 

with the assumption that $ Is of the form 

* Vi, fi\ i ^TTtEntjh • 

* 2 bn (/) 4>* e 


(37) 

(38) 

(39) 


TJie complication caused by degenerate states will not be discussed here. 
Substituting for in (38) and using (37) we have 



“ 2^E„t/ h t 2ni En tfh 

bn e = 2 bn (f) H 4>« 0 • 


(40) 


Multiply both sides by ty m x and integrate over the co ordinate space to get, on 
using the conditions of orthogonality and normalization, 


i ^ - iw' 


2irit (Em ““ Ej/h 




( 41 ) 
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A rigorous solution of (41) is not an easy matter. On the assumption that H 
is small and that the system was initially in the unperturbed state given by 
b n {o) * 1, b m (o) =* 0, m j* n, 

we have 

r -2 rit(E n -E m )lh -j 

Ut) = ^ nm 

where 

Hnm * f <l>m X H tym dx. 


The probability that the system which was initially in the state will be in 
the state at time t is 


* (i\ 2 _ 2 | Hnm] 2 T 1 

bm ^ (E n - E m f L 1 C0S 


2 j t{En “* Em)t ’ 


Sometimes no direct transition from to is possible as H nm ~ 0. It is 
however possible that the two states are connected through intermediate 
states typ. We have then to find out the probability of the transition from 
to by considering all intermediate transitions of the form, ty n ->typ and 
For this (41) is split up into two equations : 


ih : - 2*it (E n - Ep)/h , 

0 bp = I e o jf 

2r « 0 o H n p 


(41a) 


, ih : _ v -2 xit(Ep-Em)lh , (41b) 

311(1 2n hm * H Pm . 

When the state belongs to a series of continuous states let p# dE denote 
the number of states with energy between E and E + dE. The probability 
of transition to a state lying in an interval of energy A E is, on replacing 
E m by E 

f \b m V ?E dE = ~9 E | H„ m | s . (42) 

A E 

Hence the transition probability per unit time is 

*£ 9, (43) 

The problem of natural line breadth has been solved by using this* result* 
on transition probabilities and we give therefore an account of it here. 

Spectral lines have a natural breadth as otherwise they would not be 
observable. According to Bohr’s theory the breadth is the saftie for all lines. 
This disagrees with observation. The new theory provides an explanation 
of the varying natural breadth of lines. The classical treatment attributes 
this phetibmenon to the electronic self -force. A calculation oJ the, self-force 
for an oscillating electron gives the more rigorous equation, 

% = - v 0 2 * - T i 


(44) 
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where T * f e 8 v 0 2 /mc 8 . 

If we assume that 

r<< vo 

the solution of (44) may be written down as 

- nra... 


x = xo e 


cos Vo t . 


An approximate estimate of the energy is 

U = J m U' 2 + vV) =* v 0 * Xo 2 e ^ m U 0 e ^ 

By convention 1/T is taken as the average lifetime of an oscillator. In this 
time the energy diminishes in the ratio e : 1. Moreover, as 
oo 

/ Vo e~ Tt dt = U 0 - 1/T 

O 

the total action of the oscillator is equivalent to the action of a simple har- 
monic oscillator of energy U Q and lifetime 1/T. The electrical force vector 
corresponding to (9b) is the real part of 

s & -T//2 ivot 1 
E = Eoe ‘ e - ^ J ^ (v) c dv 


«*> - i fc / 


I. & f -Tt/ 2 *(v 0 - v)t 


rs - 

= fi) L 

2ff /(vq v) — T/2 

The intensity distribution is given by 1 


so that 


/ (v) ^ | E(v) 1 2 “ /o T/2tt ^ — 
00 

J /(v)dv - J 0 . 


Over the range v - v 0 = ± T/2 the intensity is not less than half the maxi- 
mum .value. Hence T is known as the half-maximum breadth. Since 
*Av 0 - T by (45) 

AX = Av 0 = I ^ “ 117 x 10- 4 A. (51) 

Thus the broadening in wave length is found to be of the order of HT 1 * cm. 
and independent of the wave length. 

The relation between the lifetime and the half-maximum breacfth follows 
from the uncertainty relation, 

A E A/ - h/2*, (52) 
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of quantum mechanics. If A / is the average lifetime of a state only so much 
time is available for finding its energy. Hence the uncertainty in the energy 
determined is 



1 

A t 


hr 

2tt 


(53) 


To determine T we proceed with a Hamiltonian for the joint system of atoms 
and radiation, arrive at the equation (41) and then, from the initial conditions 
get the form of b n {t). Let b n (0) - 1 and b n (h) = 0. Then the average 
lifetime is 

J. = / - tdbn. (54) 

0 


The lifetime obviously depends upon the transition probabilities and the exact 
relation between the lifetime of a state and the probabilities per unit time 
Wnm for transitions from to lower levels <]> m is 

T = 2 W nm . (55) 


This gives us the thickness associated with each level, 

Av« = --- S W„ m . (56) 

CtH W 

The line breadth of an emitted line, W is 

Av„ + Av„'. (57) 

It is possible on the new theory even for a weak line to be broad. Moreover, 
the breadth varies from line to line. The shape of the absorption line is 
the same as that of the emission line if there is no appreciable variation in 
the intensity of the incident line in tire region of the line breadth. It may 
be mentioned in passing that several other factors exist which considerably 
influence the shape of the line 3 ; for instance, the Doppler effect and collisions 
with other atoms and molecules. 

An excited atom, left to itself, may emit radiation and attain a state of 
lower energy. Such spontaneous emission could not be explained by the cor- 
respondence principle and the classical theory. Induced emission in the field 
of atoms and radiation could be explained without any reference Jto the # 
quantum theory of radiation as Einstein first did. The problem of spon- 
taneous emission is solved along with the other problems of the interaction 
of matter and radiation by constructing a quantum Hamiltonian for the joint 
system of matter and radiation. Dirac’ s> relativistic treatment of the electron 
gives for the joint system 

H= Ho + H • 

where Ho = S £( ft — ek A) + Pit |u + e* 4 J 

. ( 58 ) 
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A represents the vector potential of the external field- and the radiation, </> 
the scalar potential of the external field and an, tte Dirac matrices for 
the k th particle. The pure radiation field is described by the term, 


The ^probability of emission is given by the formula (43) of which the 

term | H n m 1 2 can be shown to have a factor 1 + n where the suffix is 

v 

_ Em 

h 

Hence even if =* 0 , that is, when there is no radiation of frequency v pre- 
sent in the field emission of that frequency can occur. This is the explanation 
of spontaneous emission. 

Recent years have brought to light some curious phenomena of y-rays 
which are typical quantum effects. The first is that of multiple processes. 
Suppose that a light quantum whose momentum is Kb is scattered by an 
electron at rest with energy jx = me 2 and that the scattered radiation consists 
of two quanta with momenta K\ and K> If P is the momentum imparted to 
the electron and E is its energy after the impact, the laws of conservation 
give 

k) = ki + k 2 + 

Kv + p. * K x + K 2 + E. 

The final stage denoted by F(Ki, k 2 , p) can be reached through six different 
intermediate stages. The stages may ie described in pairs as follows : 

(1) Ko> absorbed. The electron has a momentum p' = ko and (2) K\ is 
emitted. The electron has a momentum P" = Ko — Ki. When K 2 is emitted 
the final stage is reached. But we could also have as intermediate stages 

( 1 ) ki) absorbed with electron momentum p', 
and (3) K 2 emitted with electron momentum P f = ko — KL», 

<or • (4) Ki emitted with the electron momentum — Ki, 

and (5) K 3 emitted with the electron momentum - Ki - k it 

ol* (6) K 3 emitted with the electron momentum - K s . 

The transition probabilities have to be calculated taking note of all these 
transitions. For Ko « jx the double process is extremely rare. For Kq»\l 

the probability^ of this process is ^7 that of the single process. ^Processes 

are also possible scattering one impinging photon into p photons but their 
probabilities are extremely small. One witnesses multiple processes in the 
interaction of cosmic rays with matter. 
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The creation pf electron pairs and also the annihilation which may some- 
times give rise to §rems$trahlung (continuous X-ray emission) are features 
of the interaction of Hard y-rays with matter. When a y-ray quantum with 
energy exceeding imd 2 interacts with an electron in a negative-energy state 
the latter may rise ty a state of positive energy. In this process both the 
energy and the. momentum are conserved only if a third particle participates 
in the interaction. The reverse process, that of annihilation, takes place when 
an electron with positive energy has momentum equal and opposite to that of 
a positron. The energy is released in the form of two quanta with equal and 
opposite momenta. The annihilation may lead to one quantum of radiation 
if the electron with positive energy is bound to a nucleus and it may also 
happen that the energy released in annihilation is absorbed in an atom. 
When hard cosmic rays interact with the atmospheric matter electron pairs 
are created by pairs of scattered photons with combined energy exceeding 
2 me® which increases the scattering of the rays. We have here the pheno- 
menon of light scattered by light about which we will have to say something 
in the next section. A fast electron moving in the field of an atom gets 
deflected and the energy lost is emitted as) Bremsstrahlung. The latter may 
consist of one or more quanta. The calculation of transition probabilities 
of these processes goes far beyond the scope of the older theories of radiation 
and the correspondence principle. 

Cosmic-ray studies and nuclear physics have revealed phenomena which 
bear out the new theory of light, the hole theory of the positron and the 
present treatment of the interaction of matter and radiation. But* our re- 
fined optical instruments have also brought us face to face with situations 
which question the soundness of the logic of the new theory. In the follow- 
ing section, we will deal with the diffitulties which the theorist has to tackle 
in this field. 

4. PECULIAR FEATURES AND DIFFICULTIES 

The difficulties that are inherent in the nature of an electron persist even 
in the quantum theory. Although the electron is treated as a point charge 
nothing is known about the laws that hold good within a distance of 10“ 18 cm. 
of the charge. It is not proper to treat the electron as a mathematically 
abstract point within a micro-microscopic neighbourhood of itself although 
that would be justifiable in problems where inter-electronic distances of the 
order of 1A are involved. A certain sphere of influence of radius of the 
order of 10" 11 cm. is usually identified with the electron. The quantum laws 
do not hold good here. In fact they fail just outside this region also. So the 
problem: trf electronic mass and self -energy still remain unexplained, .But the 
matter does not ,end here. The difficulties are aggravated near the edge of 
the electron by those very considerations which are responsible for ^he success 
of the quantum theory at greater distances. The interaction energy* of 

17 
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matter and radiation tfhas the average value zero to the first approximation. 
But if the second order terms are retained in the interaction of an electron 
with radiation and the average value is determined it is found to be 

"" “ 13^? / ,<,v " ^ / , ‘ (v * (S91 

0 0 

This is known as the transverse self-energy of the electron as it is due to the 
interaction with light waves. Why this term is called self-energy should be 
made clear because it represents a typical quantum effect. 

The field of radiation is characterised at each point by an observable 
N whose eigenvalues are the quantum numbers for the orchestra of fictitious 
oscillators. N does not commute with the electric force vector E. Hence 
the uncertainty relation 

&N&E E 

holds good. Suppose that there are no photons in the field so that N =* 0 
and E “ 0. But the uncertainty relation says that even in such a case there 
may be fluctuations A E such that the average value of E is zero. The 
interaction of the electron with these fluctuations is responsible for the 
transverse self-energy when the field of radiation is absent. When h = 0, 
W = 0 and hence the origin of W is to be traced to the quantum hypothesis. 

The second approximation fails for precisely the same kind of physical 
reasons 'for which the classical theory breaks down when the condition 

X » r 0 - e*lmc 2 (60) 

is violated. One interpretation of this condition is already given. It has 
to be satisfied for the validity for the expression of the self-force used in (44). 
Further (46) is practically equivalent to (60) since the actual statement of 
the former reduces to 

X » 2/3 r 0 . (61) 

In the form in which (46) is stated the condition means that the life of an 
oscillator must be large compared to the period of oscillation as otherwise 
wouki be meaningless to talk about an oscillator. For similar reasons the 
condition / » h/E 0 has to be fulfilled in connection with (42) if the existence 
of'Stationary states is to have a meaning. h/E 0 gives the period of revolution 
of the electron. c 

The interaction with atoms of electrons with energies of the order of 
lOOmc* or more is inferred from comic-ray photographs. The classical theory 
breaks down for electrons of energies exceeding 137 tn<? as the condition (60) 
is no longer valid. The quantum theory gives an exceedingly higher loss of 
energy by radiation for the interaction of electrons having E > 150wtc* . The 
theory is in good agreement with experimental results for E < 150mc*. 
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As we have already remarked above electron pairs are created in a field 
of radiation with 

X < Xo = h/2mc. (62) 

This limit on quantum electrodynamics in vacuo is much more stringent than 
the limit (60) on the classical theory. What is most interesting about this 
limit is that it vanishes if we put A “ 0 or take the velocity of light as infinite. 
It is interesting to note that the limit X 0 is independent of the electronic charge. 
Incidentally, consider a wave packet of light or of a single electron ; when it 
is of an extension smaller than Xo its Fourier expansion will involve waves 
corresponding to states of energy exceeding 2mcr ; that implies the existence 
of an indefinite number of electron pairs. It is hazardous therefore to attempt 
to draw conclusions from wave packets of extension less than Xo. 

The properties of the positive electron and its interaction with radiation 
are not yet clearly understood. We have already discussed elsewhere the 
theory of the positron and we will content ourselves merely with the remark 
that the hole theory gives correct results either where first order effects are 
concerned or when the interacting field is very weak. 

A light quantum has an infinite self-energy. This is due to the fact that 
its identification with itself can be accomplished through intermediate stages 
of multiple processes involving the creation of an infinite number of pairs. 
The hole theory itself can in general be applied only when a finite number 
of pairs is created and the probability of creation is very small. 

After a critical survey of these difficulties Heitler 2 concludes : “ The 
present quantum electrodynamics gives correct results if applied only to the 
first approximation in the interaction of light with both a single free electron 
and with the infinite sea of electrons in negative energy states. Neverthe- 
less, we cannot in general hope to obtain correct results for light in which wave 
lengths X ~ r 0 occur ”. 

Further research is necessary before we understand how the electronic 
radius r 0 or e*lm<? is independent of h . In the mystery of this apparent 
independence of r 0 and h probably lies the root of future theoretical develop- 
ments. It is gratifying to note that in spite of many revolutions in thought 
the principles of momentum and energy are still applied to individual processes. 
One naturally hopes to count on them in the spade-work that lies ahead in 
this field of phenomena. 

5. THE POSITIVE ELECTRON # 

On theoretical grounds the existence of the positron or the positive 
electron was anticipated two or three years before it was actually discovered 
by Andefson in a cosmic-ray photograph. 4 The relativistic fprmul^ for the 
energy of an electron in free space is 

E - ± J(p 2 + mV) 
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in the usual notation. Negative values of energy have therefore to be ad- 
mitted in the relativistic treatment of an electron. The possible values of E 
are E + or E. such that 

Ejf ^ tttc“ , E~. ^ 

The probability of a transition from E + to £_ is finite. Actually the transi- 
tion may occur with an emission of radiation or as a result of collision. It 
is necessary, therefore, to understand clearly the physical significance of elec- 
trons in negative energy states. 

The most obvious interpretation of an electron in the state £_ suggests 
properties which have never been observed of electrons. For instance an electron 
with energy E_ and another with energy E + would repel each other according 
to the classical theory. But the force of repulsion on the former would induce 
in it an accleration as if it were subject to an attraction and in the latter, the 
usual effect. The mischief is caused by the! apparent change in the sign of 
the mass of the electron with negative energy. It is only when the mass is 
positive that the acceleration is in the same sense as the external force. The 
interpretation of the state E_ on Dirac’s hole theory avoids the difficulty caused 
by a negative mass. But the particles whose properties were discussed on 
this theory were unknown in 1930 and Dirac’s first suggestion that they might 
be protons could not be seriously entertained. Although) Dirac’s wave equa- 
tion giving the magnetic moment of the electron appeared mathematically 
satisfactory in every way there still remained the unsatisfactory feature of 
negative energy states which was finally removed by the hole theory when 
the positron was discovered. Of course, the latter theory has its own logical 
difficulties today. The following are the two assumptions on which it is 
based : 

(1 ) Ordinarily, that is, in the absent of an external field all negative energy 
states ar$* occupied. The electrons occupying the states partly constitute, so 
to say, the substratum of the universe in which the electrons move. Hence, 
normally, by; the Pauli principle, it is not possible for an electron to jump to 
a level of negative energy. 

(2) The field laws of classical electrodynamics and mechanics are sup- 
posed to represent the full effect of the substratum inasmuch as the electrons 
occupying the negative energy levels do not disturb the classical calculations 
of the force, energy and momentum. If, however, an electron leaves one of 
the so-occupied levels and climbs to a state of positive energy, the resulting 
deficiency or hoi c in the substratum reveals itself in respect of charge, mo- 
mentum and energy. 

Suppose th&feh electron with energy 2L and momentum p leav^p one of 
the occupied levels. The absence of the electron must make itself felt as a 
deficiency in the effect of the substratum. By the second assumption this 
deficiency or hole in the occupied zone gives to the exernal field a charge * 
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if — e is that of the electron, a momentum - p and an energy I E- 1 . The 
hole is therefore interpreted as a positive electron with charge e 9 momentum 
— £ and energy | 2L | . The creation of the hole or the positive electron is 
obviously accompanied by the appearance of a new electron, the one that had 
originally filled the hole, in one of the states of positive energy. An electron 
pair thus comes into existence simultaneously. Similarly when an electron 
acquires charge, momentum and energy, exactly what is required to fill the 
hole, the electron pair may be annihilated. 

The arguments regarding the creation or annihilation are valid only if a 
finite number of dectron pairs are considered. The theory of the positron 
is not valid when an infinite number of such pairs are supposed to be featur- 
ing in any phenomenon. For the very laws of mechanics and electrodyna- 
mics on which the arguments depend break down when the substratum is 
appreciably disturbed by the existence of an infinite number of holes. In 
the first place it is not logical to assume that an external field of the classical 
type can disturb the substratum itself to the extent of producing a hole in it. 
A radiation of energy exceeding 2 me 2 is needed for disturbing the substratum. 
The theory of radiations of this order of energy, that is, of wave lengths less 
than b/2mc cannot be rigorously treated on classical assumptions because the 
latter are intimately related to the possibility of all negative energy states 
being occupied. In other words, the two assumptions of the hole theory serve 
the purpose of only reconciling the negative energy states with classical 
mechanics and electrodynamics ; and the supposed mechanism of the creation 
and annihilation shows that the assumptions are plausible to a limited extent 
beyond which range phenomena transcending the classical theory. The diffi- 
culty of E _ states is closely bound up with those of radiation of high energies. 
Until they are solved one cannot determine the interaction between a strong 
field and the positrons. For weak fields the results of interaction obtained 
from the classical theory are true, at least, qualitatively. One such element- 
ary qualitative result which was responsible for the discovery of the positron 
is that positive and negative electrons develop opposite curvatures in an 
external magnetic path. 

The first serious break in the classical theory appeared when Dirac for- 
mulated the laws of quantum mechanics in the old symbolic form of Voissoq 
brackets but with the refinement of non-commutative operations, heralding 
a new philosophy of the states and observables of systems in harmony with 
experimental limitations. A further advance was achieved # when the rela- 
tivistic wave equation of the electron was cast into a linear form so that four 
distinct states became possible by virtue of the different signs of the spin and 
energy. Jt is significant that Dirac’s relativistic equation «*n be expressed 
as equivalent in form to the Maxwell equations of electrodynamics.* Because, 
from this stage onwards, a joint treatment becomes necessary of matter and 
radiation. The first important phase of the theory of radiation condqdes 
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With the second quantization which, in substance, has been described by the 
conditions (27). The difficulties of high energy in this theory and those 
of negative energy states in the other can be satisfactorily removed only in a 
unified treatment of matter and radiation. 

The developments outlined by us have been marked by an unshaken faith 
in the classical principles of conservation for matter and radiation. When- 
ever we are concerned with a partial knowledge of both kinematical and 
dynamical features of a system wave mechanics answer questions in terms 
of probability and the phenomena are governed by statistical laws. But 
deterministic laws regarding the energy and momentum of a single particle 
still hold good provided no space-time description is contemplated. The con- 
cepts of the neutron and the neutrino and our explanation of various nuclear 
processes are characterized by the success of the conservation principles for 
particles. If the principles fail, as Bohr once suggested in connection with the 
uneven /?-iays, it will only mean that we cannot isolate the particle in question. 
In the future theoretical developments of the micro-microscopic phenomena 
conservatiqn principles for isolated particles are bound to be replaced by 
field laws. Dimensionless numbers such as hcllne 1 , which Eddington has 
tried to explain, may possibly be understood in fundamental terms only when 
the developments have occurred. 
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IX QUANTUM THEORIES OF FORCES 

1. The Nature of Chemical Forces. 2. Collectivization of Electrons. 3. Some 
Typical Methods of Calculation. 4. Other Forces of Quantum-Mechanical Nature. 

1. THE NATURE OF CHEMICAL FORCES 

The concept of valence 1 is introduced to account for the whole number 
relationship of chemical combinations. It has been most successfully used 
in organic chemistry. The forces of valence are essentially electrical and 
hence a wave mechanical theory of electrons in atoms must be 
competent to explain quantitatively and qualitatively various questions 
regarding the formations of molecules. One simple criterion decides 
as to whether a given number of atoms can combine to form 
a stable molecule or not. If the total energy of the molecule is less 
than the sum of the free-space energies of the constituent atoms a stable 
molecule is formed and the energy that is released, as the molecule comes into 
existence is called the heat of formation or molecular heat. The criterion 
furnishes a very simple explanation of complicated processes such hs poly- 
merization, oxidation and condensation and of unusual compounds like BeH. 
A treatment of the w-body problem can answer therefore all questions regard- 
ing valence including anomalous and*directed valence, homopolar bonds and 
activation energies. But the rigorous treatment of the n-body problem has 
difficulties practically insurmountable and one has to resort to crude approxi- 
mations. Hence the results are of a qualitative nature only. The task of 
approximation is considerably lightened by the fact that the electrons of the 
closed inner shells and subshells can be regarded as virtually absent if the 
method of the selt-consistent held is adopted. When quantitative results are 
desired the inner electrons prove to be a nightmare. t 

On this view all valence problems are really problems of adjustment of a 
joint family of atoms. Usually, for any molecule, the adjustment has two 
principal features. Firstly, the nuclei of participating atorgs assume some 
definite relative positions. Thus the hydrogen nuclei of H 2 0 are at equal 
distances from the oxygen nucleus where an angle about 106° is subtended. 
Similarly* the spatial arrangement of the nuclei of CH 4 presents a tetrahedral 
character. Secondly, the electrons in the outermost open orbits of the 
atom which are 'called free or valence electrons are redistributed about 
the different nuclei in accordance with the Fermi-Dirac statistics^. The effect 
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of this is that some of the valence electrons are elavated to higher energy 
levels, some go to lower levels while the others remain more or less undis- 
turbed. The electrons that attain a higher level do not favour the formation 
of a molecule and are therefore? called anti-bonding. Those that occupy levels 
of lower energy are knojvn ^as^bonding electrons. The rest are non-bonding. 
The inner electrons are generally non-bonding but there are exceptions. In 
the case of copper, silver and gold one or two inner electrons become bonding 
and hence we have all of them divalent and the third also trivalent. There 
is no longer any meaning in the abstract to the question, “What is the 
valency of iron or copper ? ” For the valency, that is, the number of bonds 
depends upon the other atom or atoms which participate in a chemical com- 
bination with it This is the reason why oxides and hydrides reveal different 
valencies of elements. Herzberg has given the following semi-empirical rule 
for valency bonds : 

No. of bonds = i (No. of bonding electrons — No. of anti-bonding electrons.) 

( 1 ) The rule has been verified in several cases. In the case of nitrogen there are 
ten outer electrons for two atoms ; two are anti-bonding and eight bonding. 
Thus (1) explains! the triple bond of nitrogen, N=sN. 

If E is the energy of an atom with n electrons it is generally supposed to 
be expressible in the form 

E = E q + E E r (2) 

r = 1 

< 

where E 0 is the energy of the nucleus and E r , that of the r th electron. This 
assumption cannot be rigorously true as the interaction energy is distributed 
over the terms of the right-hand side in (2). If the n th electron is removed 
the energy of the ionized atom is not just {E - E n ) ; nor does the energy 
of the r* electron remain E r . But it may be possible to express the new 
energy as 

E' - E' 0 + i l E' n (3) 

r-=l 

where all the symbols are new as they represent a redistribution of the inter- 
action energy. The energy of an electron in the atom is thus a fictitious con- 
cept introduced for the sake of the simplicity and convenience of the formula 

(2) . A rigorous determination of E is impracticable and hence one has to be 
content with an approximation. The self-consistent field method is usually 
very useful for calculating Er when the interaction energy is negligible. In 
view of this it is difficult to decide whether an electron is bonding, non- 
bonding or otherwise especially when the number of valence electron# is large. 

The! spin serves as a valuable index to the nature of the valence electron. 
The part played by the electron in chemical combination was very much 
Ehisjmderstood in the earlier quantum theories of valence. The bonds were 
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attributed to the magnetic forces of the spins. But this could not be right as 
these forces are comparatively very small. According to the earlier theory 
of heteropolar valency as given by Lewis for dissimilar atoms each nucleus 
tends to be surrounded by complete shells of seven members of electrons as in 
an inert gas atom. It was interpreted latent^ a tendency of the electrons 
with opposite spins to form pairs. It was also supposed to explain the elec- 
tron-pair bonds of homopolar valency. If the matter were to be decided 
merely by the pairing of bachelor electrons of opposite spins both BH 3 and 
B 2 H 0 should exist. We know, however, that the former is not normally pos- 
sible. 0« is a stable molecule in which all the electrons are not paired and 
hence there is a resultant magnetic moment. As against the theory of the 
electron-pair bonds there is the example of H 2 in which only one electron is 
responsible for the chemical bond. Quantum mechanics has shown that the 
electron spin controls the phenomena of chemical combination through the 
Pauli exclusion principle. For the choice of the -function upon which the 
magnitude of the interaction depends is apparently influenced by the spin. 
Heitler and London's method shows why the valency in many cases is equal 
to the number of bachelor electrons in the outer shell. The secret of the im- 
portance of the spin lies in the fact that the exchange energy and the spin 
interaction can be simultaneously diagonalized. 

One expects that the number of bachelor electrons in the outer shell of 
an atom to be zero or one. Considerations of energy suggest that if there 
are even valence electrons they would be all paired if the energy is to be 
a minimum. Similarly, when there are odd electrons it is excepted that only 
one of them would be a bachelor electron. The exception is fulfilled in the 
case of lighter atoms. But even in their case it is found that in the adjust- 
ment preceding the formation of a molecule some of the electron pairs may 
be divorced. The various possible valencies exhibited by chlorine, which are 
1, 3, 5 and 7, are explained by this process. Carbon is known to be quadri- 
valent. In the ground state there are only two bachelor electrons in the 2 p 
shell. In the presence of another atom one pair is usually divorced so that 
four electrons with parallel spins appear in the carbon atom. 

When two electrons are very close to each other their average interaction 
energy is large and the formula (2) fails. Heavy elements with* a large 
number of valence electrons illustrate the situation. If the Fermi-Dirac stati- 
stics is to be obeyed and if the building-up principle is not to be violated 
two such electrons must move in different spatial orbits. Their spins may or 
may not be parallel. But under the influence of an external electrical 
field the spins may be made parallel. That is how the large number of 

valencies of transition metals, iron and others are accounted .for. r 

• 9 

Various quantum mechanical theories have been proposed to remove the 
difficulties of the n-body problem which vitiate the theoretical conclusions 

18 
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regarding chemical combinations. There are the directional and non-direc- 
donal theories of Dirac and Slater ; the theory of molecular orbitals first 
used by Lennard-Jones and that of Heitler and London are also at the dis- 
posal of the research worker. We shall give an account of the latter theories 
with reference to the ionized molecule H 2 + . 

We have already referred to bonding, non-bonding and anti-bonding 
electrons. The theory of H 2 + is essentially one of a ‘ vagabonding ’ electron. 
The first account of the vagabonding electron appeared in Heisenberg’s papers 
but now it is recognized to be definitely wrong. Heisenberg’s treatment 8 of 
the problem was based on the analogy of two simple pendulums supported 
by a horizontal string. 8 If one of the pendulums is set in motion its motion 
is gradually transferred to the other so that after some time the first is 
brought to rest The motion is thus completely transferred from one pendu- 
lum to the other at regular intervals. Heisenberg argues that the electron of 
H* + similarly changes its nucleus with a frequency v. The wave function for 
the electron was shown to be 

* “ J K( e " 2viWli/h + e ~ + 4-° B ( e " 2 ^ W ^ h - .-***)] 

(4) 

so that the probability density becomes 

4> x <i> “ I | * cos 3 2irW + | 4>° | * sin 3 2irvf, (5) 

« 

where 2 /tv = W\ — W* <1> A and 4> B are certain functions of spatial position. 

4 . 0 
The coordinates of position measured relative to the nucleus A appear in 

f A 

only and similarly is defined for the other nucleus. At t = 0, the 

electron is about the nucleus A and at t = 1/v it goes over to the field of 
the other. W x and W 2 are the energies corresponding to the symmetrical and 
anti-symmetrical wave functions. This explanation is rejected on the ground 
that the formation of H 2 + is a static phenomenon and that by the principles 
of wave mechanics transitions between symmetrical and anti-symmetrical 
spates cannot happen. Hence it is wrong to assume the wave function (4) for 
the valence electron. 

Frenkel has used the word * collectivization ' to describe the relation of 
the valence electron to the two nuclei. Every chemical combination is a 
collectivization of electrons by the nuclei which also adjust their relative 
positions so that the total energy becomes a minimum. This idea of sharing 
up electrons is* quite old : it appeared in Lewis’s theory. But only wave 
mechanics fyas worked it out successfully. We are now #>ing to see how, 
id particular, one electron can be shared up between the two nuclei. 
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2. COLLECTIVIZATION OF ELECTRONS 

In the method* of molecular orbitals we start with the assumption that 
the wave function of an atom with n electrons can be expressed as 

4> - $1 (ft, y \, ft) $8 (ft. >2, ft) (ft», ?H, 2 m) (6) 

where <lv (ft, yr, ft) is the molecular orbital for the r 01 electron. It is assumed 
that is a solution when other electrons are absent. As a particular case 


consider two hydrogen atoms with two valence electrons. If we call the 
atoms A and B we may take 

4>i * a ( xu yu ft ) + b ( ft, % ft), (7) 

$4 and $£ being possible wave functions for an electron in the fields of the 
nuclei of A and B respectively. If and are real and normalized 

f ^ dz = J dz * 1, (8) 

and if $i is also normalized 

( 2 s + 4* + 2ab I = 1 (9) 

where I =* J $£ dz. (10) 

If the other electron is in an equivalent state 

& * aty A ( ft, y s , * ) + bty B ( ft, y*, ft ), (7 ; ) 


and <J> = a* (1) (2) + ^ feW 4*5(2) + ab( $ A (1) 4>b (2) 

, +4 >a ( 2 ) 4>b (1) )• (ID 

In the method of collectivization which is developed by Lennard- Jones, 
Mullikan and others only the first two terms on the right-hand side of (11) 
are considered as significant. These terms are called ionic .since the electrons 
are both in the same atom A or B. It is expected however that the electrons 
would avoid each other on account of mutual repulsion. Hence in taking 
only the ionic terms as is done generally in this method one commits the 
mistake of ignoring the repulsion between the electrons. In the Heifer and 
London method this mistake is avoided by rejecting the first two terms and 
retaining wily the coefficient of ab in (11). If the distance between the 
atoms is large the rejection of ionic terms provides a good approximation but 
when the distances are small the second method breaks down. 

Consider two hydrogen nuclei fixed at points (cfc i R, 0, 0), R being the 
distance "between them. For an electron in the field of .the nuclei let a 
possible wave function be 

4 >(x,y,z)e~ 2 * iEt,h ‘ 
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As the probability density | 1 must be symmetrical with respect to the 

protons 

z) = y ,z). (13) 

If there is only one electron in the field we have, following (7) 

4> ( x, y, z) - a 4> 0 ( r A ) + &4>o ( f B ) = aty A (*, y,z) + bty B (x, y , z) (14) 
where the vectors r A and r B , measured from the two nuclei A, B to the 
probable position of the electron can be expressed in terms of x t y, z. By (13) 
it is clear that 

a - ± b. (15) 

If 4>o is normalized and 

I “/ 4>o (r A ) ( r B ) d'l, (W) 


we may choose for either of the normalized wave functions. 


_ Ur A ) + 4>o(?b) 

* s ' { 2 (i + nr 

(16) 

, tyo(r A ) - ty 0 (r B ) 

or <J>« = - - , 

{ 2(1 + nr 

(160 

The wave equation which 4> has to satisfy is 


v ! 4> + (E-v) 4> = o 

(17) 

where V = — - — e • 

«A r B 

(18) 

Since 4>o ( ) satisfies 


•»-* + S£(* + £)t-o 

(19) 


and ty 0 (r B ) a similar equation it follows that (14) is not a rigorous solution 

$>f (17), The condition (15) improves the degree of approximation of 4>. 
Moreover an exact solution of (17) is too difficult and too complicated for 
application. 

So far nothing has been said about the choice of the solution of (19). 
Let us take the wave function 4>ioo corresponding to the ground state of the 
hydrogen atom. 

*0 <m)-$i» - (aV^r (20) 

where a * Vm e*/A*. (21) 
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Using polar coordinates with reference to the midpoint of the nuclei and the 
line AB, we have 


4>o(m) = (« a M 1/! r a(r + * Rco8tf) as(«»/»)* < af (i-iR«co9«). 


(22) 

It is assumed that R is very small compared to r. So 

4U (*, y, z) = (a 3 /jr)^e * r (l - JRacostf). (23) 

Similarly we get 

<Mr s ) ^ («*/*) e -ctr (i + £R<xcos0) (24) 

and that gives <l>jg ( x, y, z ) also. The integral (100 can now be evaluated. 

I ® “ f e 2ar ( 1 - i a 2 # 2 cos 2 0 ^ d\ 

(25) 


1 - 


a 2 2? 


12 


The very form of the wave function (12) is that of a static distribution 
of probability. Since the energy E of the electron is fixed the uncertainty 
principle makes the time indefinite. According to the description of pheno- 
mena by $ there is, in the present case, no such thing as an electron having 
one position or one probability distribution at one time and another at a 
different instant. We get a static probability density | $ I 2 which gives rise 
to the conventional interpretation providing the charge density of the electron 
cloud in the form e\ $ I 2 - This is merely a pictorial way of describing a 
purely mathematical situation in familiar terms. It has been already re- 
marked that Schrodinger’s first interpretation of failed because the charge 
cloud was supposed to possess the properties of an actual charge distribution 
as in electrostatics. A warning being given regarding the current use of the 
term ‘ charge cloud 9 for the expression e | 4> | 8 we will now show how it 
helps us to understand the chemical forces. 

When R is very large I becomes negligible and the functions (16) and 
(16') give probability distributions such that there are two practically distinct 
charge clouds each about a nucleus. Here is one of the simplest cases of 
collectivization, one electron being shared half and half by two atomS. 
The cloud about A has the density i e \ <l> 0 (?a) I 2 , that about B having 

The calculation of the force is a very straightforward matter and we will 
show how it is done for the symmetrical as well as antisymmetrical wave 
functions. In view of the fact that is not precisely determined our solution 
is only approximate. It is 

ty, e ~ UE,t!h or <| )a e - 2 * iE ‘ tlh w (26) 
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where E, and E a are energy functions for the two states both being functions 
of R. The force is given by —dEJdR or — dEJdR . 

It follows from (17) that 

/ E | + /4- x V’ 4> dr. 

In the rigorous solution E turns out to be when use is made of the normali- 
zation condition 

E = fVV'tydx. (27) 

As we do not know precisely we get the approximate values 

E, = fv | 4>. I ' dr- ^ J 4>, x V s 4>. dr, (28) 

E a = fV | 4>« | s dr - ^ JV V*4’« dr. (28') 

The equations satisfied by (?a) and 4>o (?b) run as follows : 

V ! 4>o (m) + (So - v 4 ) +0 (M) - 0. (29) 

V 2( l»o (f B ) + ^ (£o“ V £ )<M?b) = 0. (290 

where Va = — , Va = - eVffi 

and 2?<> ,is the eigenvalue for the state 4*100 of hydrogen. Hence as 4« is real 
£, - / ( | + V A +V B ) 4. s dr - g 4>» V’ 4*. dr (30) 

= E 0 + ~ + " (300 

2 2 

where M m JVa | 4*0 (?b) | c/T = Jv b | 4-0 (m) I dr (31) 

and L * JVa 4»o (r A ) 4*o ( rs ) <ft * JVfl4>o (m) 4>o (rs) dt. (32) 

Similarly, 

, . £ a = £«, + -^- + • (30") 

The meaning of M is quite obvious. It represents the electrostatic energy of 
the 4 part of the electron * at one nucleus in the field of the other. The ex- 
change integral t represents a typical quantum effect. There is no classical 
analogue of it. It brings out the total effect of the interaction between the 
parts of the electron in the field of one of the nuclei. The values of L and 
M are ao follows : 


L-%[ 1 + . *]«““* 


(32') 
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M " ~i~ + T ( 1 + aR) e ~ 2aR ’ (31 ° 

For evaluating L and M we make use of (20) and (21). The value of I is 
found out to be 


I"i[l + oR+4 ( clR )* ] (33) 

When substitutions of these values are made for I, L and M in (300 and 
(30") we get the required functions E s and E a . It must be noted, however, 
that the values of E s and E a cannot be considered as reliable on account of 
the approximations adopted in the method. The nature of the E s - R and 
E a — R graphs is all that is significant. dE s /dR is positive for some values 
of R. It becomes zero as R diminishes, becoming negative for smaller values 
of R. Hence there exists a critical distance for which the total energy E s as 
a function of the parameter R is a minimum. The two atoms can come 
nearer under a mutual attraction but when the critical distance is reached 
there is a stable configuration. On the contrary, the behaviour of E a - R 
shows that there is no attraction between the atoms and that the smaller the 
value of R the greater is the force of repulsion. 

Consider a hydrogen atom and a nucleus of the same element approach- 
ing each other. Suppose further that the relative velocity of the nuclei dies, 
on account of collision or radiation, when the two are near enough. If the 
electron happens to be in the antisymmetrical state there would be a 
repulsion between the atom and the nucleus. If, however, the electron is in 
the state a molecular ion would be formed. Our calculations are carried 
out for the ground state 4>ioo* But we can similarly determine whether a 
stable ion can be formed when the electron in the field of either nucleus is 
originally in some excited state. The problem has been worked out by 
several investigators who have examined theoretically the stability of other 
states of H s + . 

In connection with the one-electron bond of the problem under discussion 
Frenkel 2 gives the analogy of the protoplasmic tie which connects two cells. 
When the probability distribution produces most of the charge cloud between 
the two nuclei the latter are at least partially shielded from the repulsion of 
each other ; and it may happen that the force of attraction of the charge 
cloud triumphs over that of repulsion bringing the two nuclei nearer. This 
is what happens in the Estate considered above. In the other state the 
charge cloud is mostly distributed about the line AB produced both ways. 
This only increases the repulsion between the atom and the nucleus. The 
bonding electrons are responsible for the intermediate charge cloud, the 
antibonding ones going farthest away from the nuclei. Thus in the^fee of 0* 
there are eight electrons from the two atoms participating most conspicuously 
in the readjustment of levels. Of these six are bonding and produce % the 
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intermediate charge cloud responsible for the attraction. Two are non- 
bonding and their corresponding charge clouds are separated by the two 
oxygen nuclei. Usually the non-bonding electrons in pairs are bachelors with 
parallel spins. Thus the double bond of oxygen is explained by Herzberg’s 
rule and the magnetic moment is accounted for by the two bachelor electrons. 

We will apply the Heitler-London method to discuss the formation of the 
molecule Hs. The two nuclei will be distinguished as before by the suffixes 
A and B and the two electrons by the suffixes 1 and 2. Thus (r^ x ) is the 

vector from the nucleus A to the electron number one. Similarly (Mi ) etc. 

are interpreted. Following (11) Heitler and London start with the symmetri- 
cal function 

*. = 7|2(lVpjl[* ( ^ >+♦<**> W*)] {34) 

where the condition of normalization is satisfied when we take 

I « J<1> ( Ml ) <1> ( ) dz = J <1> ( r Ai ) <1> ( tb 3 ) dx. (35) 

It is assumed that ( tA x ) etc * are all normalized. It is definitely understood 

that both the electrons at A or B are in the same spatial level so that their 
spins are opposite. The very form of implies that the resultant spin is 
zero. The corresponding antisymmetrical function which implies parallel 
spins is ( 

[+<**> 4> (t* ) - 4- (?a.) * (r* )]. 06) 

Three distinct properties of and # ^ fl should be noted : 

(1) For an interchange of the electrons, that is, for an interchange of 
the suffixes 1 and 2, 4>s remains unaltered while changes its sign. This is 
the property that we have in mind in calling one wave function symmetrical 
and the other antisymmetrical. 

(2) If the nuclei are situated symmetrically about the origin on the axis 
of x, and if the ^-coordinates of both the electrons change their signs re- 
mains ' unchanged while changes its sign. Thus has the property of 
spatial symmetry with respect to the nuclei ; has spatial antisymmetry in 
in relation to them. 

(3) An obcervable a which is symmetric in the spatial coordinates of the 
two electrons is always in the state $s or in the state provided only these 
two states are permissible. A transition from one state to another is not 

possible.since « 

• < 

, J 4»« x CL <1>, dx - 0, 

identically, as the symmetry properties show. 


( 37 ) 



QUANTUM THEORIES OF FORCES 


145 


The kinetic and potential energy of the system of two electrons in the 
field of two fixed hydrogen nuclei may be calculated straight from Schrodin- 
ger's equation or, from first principles, as shown below. 

The kinetic energy of the electrons, on the classical theory, is 
T 3 jm U* + y? + z?) + * m U 2 + A 1 + z 2 2 ) 

= in 11 *' 1 + P * + + h (P « + + **«>• 

Hence the average kinetic energy in the state 4>s is 7 : 

T = JVrfc d-c 

“ ~ & / ( +«* V ‘* + *•* V “ 2 4 ** ) (38 ) 

where Vi 2 , V 2 2 have the usual significance for the two sets of coordinates. 
Similarly, the average potential energy in the state $s is V : 



These evaluations are carried out by making use of the equations which 
( tA \ ) etc. satisfy, viz., 

+ O - 0. MO) 

Eq being the characteristic energy for ). For the other functions also 

E q is assumed to be the eigenvalue of energy. Let us denote 7 + v by J. 
There are two values cf E, corresponding to the wave functions, which we 
will distinguish by proper suffixes as "E s and E a . E s < 2E 0 , E a > 2E 0 within 
the degree of approximation for R. Hence cfEJdR > 0 in the domain of 
interatomic distances with the result*that there is an attraction between the 
nuclei. If, however, the nuclei are brought nearer than a certain distance R c 
attraction gives way to repulsion. The antisymmetric state does not favour 
chemical combination as the E a - R graph shows. 2 E 0 - T? s is the heat of 
formation. This value cannot be taken seriously in view of the various 
approximations. In the first place, the very form (6) for $ is inappropriate for 
quantitative discussion. For if (6) is true two dectrons can come as near to 
each other as two non-interacting particles. But we know that each election 
carries about it a space-pocket which is closed to another, on account of 
mutual repulsion. The exchange symmetry makes the approximate solution 
qualitatively valid ; the spatial symmetry permits a collectivization of the two 
electrons by the nuclei. Hence the bond is particularly due to the latter. 

It is futile to discuss the relative merits and demerits of the two methods 
just outlined. When more than two electrons are involved it is necessary to 
start with more general assumption? than those of either metfeod. • Even when 
there are only two valence electrons one may have to modify the method for 
dissimilar atoms participating in the chemical combination. 

19 
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It is possible to explain directional valence by means of the wave func- 
tions of the valence electrons. When the valence electron is an s electron the 
wave function has spherical symmetry owing to / = m * 0 and no direc- 
tional effect is witnessed. For a p electron there are three kinds of wave 
functions corresponding to the possibilities, t = 1, m * 0, / « m «1 and 
/ = — m = 1. They may be stated as follows 

tynio * fn (f) COS 0 % 1 

tym - fn ( r ) sin 0 e* / *j2 t 
* fn (f) Sin o /"**/ J2. 

In the absence of an external field the wave functions of Schrodinger’s theory 
are real and hence in the valence problems appear the wave functions 
f n (r) sin 9 cos <f> etc. The result of it is witnessed in the charge cloud which 
assumes a dumb-bell shaped form. For the molecular heat to be a maximum 
it is necessary that the exchange integral should be maximum. Hence the 
dumb-bell distributions of two electrons must overlap as much as wave- 
mechanical conditions permit. The directional effect of valence arises more 
or less from such overlapping of directed charge clouds. 

3. SOME TYPICAL METHODS OF CALCULATIONS 

In the more complicated calculations of valence and in the problem of n 
electrons a special mathematical property of the Schrodinger function is of 
great use/ Let $ be a normalized wave function so that 

m = E$. ( 41 ) 

In particular let E 0 be the value correspopding to the ground state of the 
system. Suppose further that ty' is some other function which satisfies all the 
field and boundary conditions including that of normalization which $ satisfies, 
except (41). Then it can be shown that 

£' = / 4> /X J¥4 Ydx > Eo. (42) 

For we can expand 'l*' as follows : 

<!>' - S a, 4>„ , - (43) 

n 

where $ 1 , $*•••• etc. are a complete set of normalized orthogonal functions 
each satisfying a Schrddinger equation of the form 

= En*n. (44) 

Hence • <1>' X Hty * 1 u. x HI an 1 'IV 

• n »' 

J 4»' x Hty'ax » I On* On Bn. 


and 
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Therefore 

f $' x H< y d%- E 0 = laSanEn- Eo 
J n 

** 2 Cn{En' mm Eq) (45) 

* n 

since by the condition of normalization it follows from (43) that 

S*, x o»-l. 4 (46) 

n 

As E 0 is the least eigenvalue (42) follows from (45). When is not precisely 
known one may try a number of plausible functions in order to get at an 
estimate of E 0 . What is most helpful in these calculations is that even when 
differs from appreciably the corresponding deviation of E' from E 0 is, in 
most cases, rather small. The method has been successfully employed to 
determine the value of energy in the ground state of the normal helium atom. 
When the eigenstates are not discrete but continuous an integral replaces the 
right-hand side of (43). The procedure is substantially unaltered for this 
case. The variational method is known also as the Ritz method and it is 
most helpful in practice for calculating the molecular heats. 

In valence problems and in several theoretical investigations of polyatomic 
molecules the group theory is of great service. Its great advantage is that 
some of the inner structural relations of mathematics are disclosed as funda- 
mental properties of a group without having to solve intricate differential 
equations and plough through devious approximations. Thus the selection 
and polarization rules for polyatomic molecules can be deduced either by 
quantum mechanical methods or by the application of the group theory to 
the classical treatment. It is true of course that there is a limitation on the 
results that can be obtained purely t by the group theory. To know how the 
group methods are applied and how certain interesting relations are obtained 
therefrom the reader may find it worthwhile to take up certain very elementary 
problems first. 

Suppose that we want to know the number of persons' present at a 
meeting. Somebody counts and tells that the number is, say, 45. You ques- 
tion the accuracy of the calculation and then the counting is done openly in 
order to convince you. Even if the counting yields the same result >jpu would 
be justified in not accepting the result provided you do not know fundamental 
properties of cardinals and ordinals. Actually what happens in most cases 
is just the reverse : you do not know the properties and are not even aware 
of their existence and hence you usually accept the result of a count on 
account of convention and experience. But in doing this you are either 
illogical or unconsciously making use of the theory of groups. If a particular 
count gives the result, 45, what is the guarantee that a different »o5unt would 
give the same insult ? Unless that guarantee is first secured it ja not right to 
rely upon the result of a particular count. Moreover, if different counts are 
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to give different results there would be no such thing as a. definite number 
associated unequivocally with the crowd of persons present. Now it is pos- 
sible to establish one-to-one correspondence between all the different (which 
we know 1 to be) (45) ! counts so that by means of substitutions any count 
cairbe transformed into any other. Hence it is this fact which helps us to 
see that any one accurate count is enough to know the number of persons 
present. 

As a less trivial example 7 consider the possible number of regular poly- 
gons of n sides. If n = 7, we see that a cyclic group of order 7, (S 7 “ 1), 
can be generated by the elements, S, S 3 , S 3 , S 4 , S 5 and S 6 . There are 6 
different generators. We associate with S and S\ with S 2 and SF\ and with 
5 s and S 4 the same polygons but described in different order. Hence we 
conclude that the number of polygons for n = 7 is 3, being half the 
number of generators. For n m 8, we consider the generators of the cyclic 
group of order, 8, ( S* = 1 ), and we find them to be S, S 3 , S 3 , S 7 . Hence 
there are only 2 regular polygons for n = 8. In the general case also it is 
half the number of generators when (S n = 1) ( which is the same thing as 
half the number of numbers prime to n ) which provides the required answer. 
The process of constructing a regular polygon can be shown to consist 
of certain operations each of which can be represented by some one of the 
elements of the cyclic group ( S* = 1 ). 

The fundamental postulates 4 which give an ensemble of elements A , B, C 
etc. the character of a group are the following : 

(i) Two elements can be combined in a certain order, the result being 
an element of the group. Thus AB = C. But it is not generally necessary 
that AB =* BA ; a condition, which is satisfied by Abelian groups. 

(ii) The associative law holds good so that if A, B and C are any three 
elements ( AB ) C = A (BC). 

(iii) There exists a unit element I such that AI * I A = A for any 
element A of the group. 

(iv) Corresponding to every element A there exists a reciprocal AT 1 
such that AA~ l * AT 1 A - I. 

When the number of elements of the group is finite, say n, the group 
is called finite and of order n. The four numbers ±1, ±i provide an exam- 
ple of a finite group. In this case the multiplication of elements AB has 
the usual arithmetical significance. But in general when A and B stand for 
abstract operations we mean by AB, the result of two successive operations 
done in the order "in which the symbols are written. Consider, for example, 
the symmetric permutation group of three letters a, b, c which is represented 
by the six substitutions, 


V 

A, 

At 

A t 

A, 

At 

abc 

hoc 

acb 

cba 

bca 

cab 

abc 

abc 

abc 

abc 

abc 

abc 
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Thus A, is/ and A\ At * At. Similarly A% At means first substituting 
a, c, b for a, b, c and then substituting c, a, b for a, c, b. The result is equi- 
valent to the substitution cab for a, b, c, that is, A 6 . So At At = At. This 
group is called symmetric for the reason that its operations leave any 
symmetric function of a, b, c unchanged in value. 

A select set of operations of the group may by themselves satisfy all the 
postulate of a group and thus constitute a sub-group. Take any element A 
of a group. Then A 3 , A 3 , . . . etc. must be also elements of the group. It is 
certain that A, A 3 , . . . A r , etc. cannot have more than n distinct values 
since n is the order of the parent group of which A is a member. Let A r = I. 
Then it is clear that the higher powers of A will not give any new elements. 
Hence the powers of A constitute a sub-group of order r and it is called a 
cyclic subgroup. 

When the elements of two groups have a one-to-one correspondence they 
are said to be isomorphic. Thus if A corresponds to A' and B to B' , AB 
must correspond to A'B'. Unless the last condition is fulfilled the groups 
would not be isomorphic. Further, if one of the groups represents analytical 
operations it is said to give a representation of the other group which consists 
of purely abstract operations. Such a representation of the cyclic group, 

( A r * /), is given by the elements, e‘ iri ^ r where p - 0 ( r - 1 ). 

Let {G} be a group of order » and let { H) be a sub-group of order m. 
It is obvious that n/m must be an integer. For consider an element Ai which 
does not belong to {H}. Then HA X are m new elements which do no* belong 
to { Hi . If this is not true either HpA x = H„, that is A x = Hp~ l H Q or 
Hp Ai = H„ A\ that is, Hp = Hp both of which are impossible if Hp and H„ 
belong to \h) and A x does not. Let 4» be an element of {G{ which does 
not belong to {Hi and also to the set HA X . Then HAt is another set of 
Alpmpnta which can be similarly shown to be new. This process must come 
to a stop somewhere say at Ap. j. Then all the elements of {<?} are the mp 

elements, H, HA, H Ap- X . Hence n = mp. Also it can be shown 

that, if {#} is a self-conjugate group 

H, HA\. HAt HAm < 47) 

is itself a group of order p. It is usually denoted by {G/H}. What is meant' 
by conjugate operations will now be explained. 

Let A, B, S be any three elements of {G}. If 

B - SASr' < 48 ) 

A and B are said to be conjugate to one another. If A and B are identical 
AS * SA. An important property of conjugate operations is that, if "A » / 
then * 

H * (SAS"‘) r = SA r S-' = 7. ( 49 > 
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Now all the elements which commute with any element A form a group 
For if S and T are two such elements of {G} 

S « ASA~ l , T - ATA- 1 
and hence ST = ASA- 1 ATA' 1 - ASTA-\ 

Let us call this group {//}. Then using the result (47) we know that 

A, Af 1 , A A\, Af l A A if ... . A~ l p-\ A Ap-i (50) 


are all the conjugate elements of which A is one. They constitute what is 
called a class and its order is an integer. The elements of the group can be 
divided into such classes. 


Consider now a set of substitutions 

X, = S a r » X', (51) 

which is also expressed as 

X = AX' (52) 

where X and X 7 are -dimensional vectors with components X r and X 7 * and 
A is the matrix | a rs I . If 

X 7 = BX" , X" = CX f " (52) 

are similar substitutions 

X 7 = AB%" and X * ABCX'". 

Here A,B,C, etc. constitute a group. Every finite group is isomorphic with 
some group of linear substitutions. Hence matrices can always be associated 
with the abstract operations A,B f C etc^ Suppose that by choosing a suitable 
transformation all the matrices of the group can be made to appear as 


A 


( A x .0 0 
0 A , 0 

• • • >• 
0 0 0 



(53) 


whera<Ai, A } , Ap are square matrices. In general when the matrix A r is 
repeated a r times in such a representation we put 

A - S A r . (54) 

r = 1 

Thersum of the diagonal elements of A is called the character and is shown as 
X ( A ). This Jias great importance in applications since if B - SAS~ l , 

X(BS) - X(Si4) 

X (B) - XiSAS- 1 ). 


and hence 


(55) 
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If A denotes the matrix of an observable and B is its diagonalized form we 
know that the diagonal elements of B give the eigenvalues. The sum of the 
diagonal elements has therefore a physical significance. The orthogonal 
property of wave functions is essentially the result of the orthogonality of 
group characters. The following two properties of the group characters 
are very helpful in applications. 

(1) If the group can be split up into p different classes there are p dis- 
tinct parts into which A can be split up, A h A it . . . etc. so that none of these 
can be reduced further into the form 



where P\ and P 2 are square matrices. The group characters for the parts 
satisfy the condition, 


X(A) - fa, XU, ). 

(56) 

J = 1 


(2) The characters are subject to the relations 


S XU, ) XU,/) - hi# 

(57) 

and . S [X (/> )] s = h. 

(58) 


I s ® 1 


Here we have h as the order of the group and the summation in (57) is made 
for all the operations of the group. In quantum mechanical studies of the 
spectra properties of the symmetric group are extensively illustrated. 

One property which we wish to r^ention last concerns direct products. 
Suppose we have two vectors X and Y of dimensions tn and n and with 
matrices of transformation A, B : 

X n^AX', Y » BY ' 

or * = *« > y = y&. 

The direct product of X and Y provides a vector Z of dimensions mn and the 
law of transformation is 

Z = CZ> or Xi y i * SS a ia x a y^ (60) 

Here the mn components are #* y>, i = 1, 2, ... m,j * 1 n. Hence 
X (C) = X \a itl 6^1 = Sty - x (A) X (B). <61) 

From any transformation of the form 

* x * ax + by 

y = cx + dy 
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it is possible to determine the law of transformation for a vector whose 
components are 

/ /-l f~r r f 

x, x y x y y . 

Thus the laws of transformation connecting the (/ + 1 )-dimensional space 
become known. Similarly we can consider the laws of transformation for 
antisymmetric functions of the form 

Xi y* - xu yi 

when Xi and y* are vectors of n dimensions. The matrices of transformation 
of antisymmetric tensor components can also be multiplied as in (60) and 
the result can be expressed in the reduced form as in (53). This gives the 
Clebsch-Gordon series. 

Suppose 3 that there are two atoms whose outermost subshells contain 
electrons f\ and fi. We consider the two nuclei as fixed at some distance 
apart. The interaction energy W has a definite symmetry for both the atoms 
and its laws of transformation give us a matrix A! = A\ x A*. The matrix 
is invariant for all symmetrix transformations. This is expanded in the 
vector space of electron spin and the irreducible components give us the 
valence bonds and the states of symmetry of the molecule. The procedure 
can be adopted for more complicated chemical combinations. The numbers 
of external electrons corresponding to the states in the expansion, when com- 
pared to fi and / 2 tell us what electrons are bonding or antibonding. 

One of the most important applications of groups is in the study of 
crystellography. Each crystal is characterized by an expression of elastic 
energy quadratic in the six strains and involving twenty-one constants. There 
are thirty-two different groups some one of which leaves the energy expression 
invariant for any crystal. The nature of symmetry in the crystal determines 
the groups which provide relations between the constants. 

4. OTHER FORCES OF QUANTUM-MECHANICAL NATURE 

The subject that comes under the title of this section is very vast. First 
there are the forces of cohesion which explain the surface tension pheno- 
*mena' and latent heats and as they are responsible for the constant a of 
van der Waals’ equation, 

(*+£)(#-*)» RT, 

they are also referred to as van der Waals’ forces. For smaller distances 
the forces of .repulsion between atoms are more powerful. They account for 
the rigidity of matter and also for the other elastic properties. When the 
atoms are* brought still nearer together so that there is a partial overlapping 
of the electron clouds, again the forces become attractive and formations of 
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molecules are witnessed. At still smaller distances the forces of repulsion 
between the electrons arrest our attention. The Fermi-Dirac statistics of 
antisymmetrical states describes the full effect of interactions in large assem- 
blies of electrons. Once more and for intra-nuclear distances forces of attrac- 
tion assert themselves. There is evidence that two protons attract each other 
within a nucleus. The nuclear forces are treated as exchanges forces and 
we will give some account of them. It is not possible to touch here the 
topic of the constants of elasticity ad that would take us too far into a dis- 
cussion of metals and crystal structure. The orthodox treatment 5 of the van 
der Waals forces between two molecules was based on the electrostatic forces 
given by the usual expansion for the potential of a charge assembly, 

K ■ t + i 19 +-«a 

where Pj = I ** P 2 ■ y it P z =* z> 

0 n = 2 ei Xi , 0i2 = S d x* y it etc. 

/? = X? + X? + * 3 2 ; 

and X l9 X 2t X s are the coordinates of the current point, the origin being at 
the centre of gravity of the charges. For a neutral molecule - o and 
the first term in the expansion (62.) vanishes. At great distances the field 
of the molecule may therefore be that of a dipole if the second terrfi is non- 
zero or that of a quadrupole if that term also vanishes. If the third term 
is also zero the molecule behaves as an octupole. Equal and opposite charges 
separated by a small distance give us a dipole. Alternately equal and oppo- 
site charges at the comers of a parallelogram have the field of a quadrupole. 
But if the charges on all the comers of a parallelepiped are so distributed 
that they are alternately equal and opposite the field of an octupole is ob- 
tained. The degree of polarity varies from molecule to molecule. We will 
give here the explanation of intramolecular forces first given by Heitler and 
London on the assumption that the charge distribution of molecules has the 
field of a harmonic electrical oscillator. They were guided in this, by thf 
consideration that the energy values of an oscillator account for the energy 
of molecules at the absolute zero. For two oscillators 6 separated by a dis- 
tance R and in the same line the Hamiltonian is 

o 

H • ( J\» + P s * ) + ( *, s + %:■) + 2 (63) 

when higher terms in the expansion of the interaction energy, 

+ 1 fe4) 

«/? + *, + *, R + *, R + xt v v 


20 
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are neglected. If the last term of (63) is absent the oscillators are uncoupled 
and the frequency is 


Vo 


2 7T 



(65) 


The constant a in (63) and (65) becomes known when the law of the restoring 
force in the oscillators is given. It is a measure of the distortion undergone 


by the molecules. 

Using the transformation 


X* 

= Jj, ( + * 2 ), X a = ( X| ~ X-j ) 


ii 

jco 

l 

% 

P, - (ft + ft), g - ^ (ft - 

4 

11 

Hence 




(£+£)*•} + { %+U- 

] (66) 

This gives us two oscillators with frequencies 


Ire 2 / 

Vs ' 2* L m ( 

1 , 2 \ -I l 1 r e l / 1 2 v 

a + 2„L m\a R z ) . 

]* (67) 


and hence the total energy, when the quantum laws are obeyed, is 

h v s (n s + J) + h v 0 (wo + £) (68) 


where n% and n a are two integers. In the ground state, n% ~ 0 , tt a * 0 and 

E {t = i h ( v s + v a ) “ h v„( 1 - 2tf>y (69) 

The force between the molecules is — dEldR when they are in the state E . 
The force is one of attraction and varies as R~ 7 . Thus even in the absence 
of chemical forces neutral molecules with valencies saturated may be attracted 
to each other at low temperatures. Illustrations are furnished by metallic 

molecules such as Hg 2 . 

« 

Various 10 theories of the neutron-proton interaction in a necleus have 
been offered by Heisenberg, Majorana, Wigner and Ukawa. As the method 
adopted by Ukawa suggests the coming into existence of particles such as 
the heavy electrons observed in the cosmic ray phenomena we will briefly 
describe his procedure. It must be pointed out, however, that Eddington 
has argued from the standpoint of relativity to show that the particles of 
the Ukawd theory are just comparison particles which cannot be identified 
with any real particles such as mesotrons or the heavy electrons. Eddington's 
argument is vejy difficult to follow. 



QUANTUM THEORIES OF FORCES 


155 


Ukawa’s theory originates from the classical equation of electrodynamics, 


(-jrW- r)* = 4*P 


(70) 


p being the electric density. The modified equation, 
( *> f - V s + X’)* = ilTgp 


(71) 


is taken, at the start, involving the two constants X and g. The solution of 
the new equation gives the nuclear potential 


4> = 




(72) 


at a distance r from a proton or a neutron. It is obvious that falls off 
rapidly as r increases ; this is as it should be for an observed characteristic of 
all nuclear forces is their short range. Replacing the differential operators by 
the energy and momentum symbols in (71) we have 

( - "T + P> + P» + Pi + X* ) -/- = 4 (73) 

In view of the smallness of the right-hand side term, (73) may be taken as 
equivalent to 

(- E ‘, + Pi + Pi + pi + X Jf ) <p - 0 (74) 

But this is just the wave equation for a particle having energy E, momentum 
( Px> Py> Pi ) and mass M: 


M= X/ i- 

2n-C 

From (72) one concludes that r = ~ may be taken as a good measure of 

the range of the nuclear forces. The range is known to be of the order of 
r/mc“. This gives an estimate of X. The mass of the particle is of the order 
of 250m. The emission and absorption of such a particle is the cause of 
nuclear interaction. The constant g in (72) takes account of the nuclear field. 

Thd heavy particle which may be positive or negative or everf neutflal 
(neutretto) is known to be unstable. The heavy electron rapidly decays 
with the emission of radiation into a /3 particle. The conservation of angular 
momentum now necessitates the acceptance of the neutrino postulate. For 
as the neutron, the proton and the electron have each a spin ±i, the tran- 
sition, neutron proton, cannot take place with the emission or absorption 
of a heavy electron unless another particle, the neutrino, with § a, spin ±i 
is induced to participate in the phenomenon. 


April IS, 1940. 
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[Addendum : — Conscious of many imperfections and lacunae I condude 
the Report of my Lectures at this stage. I should have liked to include a 
more thorough treatment of the group theory and its applications and also 
of the nudear phenomena. All that must await another occasion to see 
the light of day. — V. V. N. (Dec. 20 , 1941) ] 
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